Q p t i c a I 

METWORKS 

Arc h i toctu ro and Su rviva di I ity 




Hussein T. Mouftah 
Pin-Han Ho 



Springer Science+Business Media, LLC 



OPTICAL NETWORKS 
Architecture and Survivability 




OPTICAL NETWORKS 
Architecture and Survivability 



by 



Hussein T. Mouftah 
Pin-Han Ho 

Queen’s University at Kingston 
Ontario, Canada 




SPRINGER SCIENCE+BUSINESS MEDIA, LLC 




Library of Congress Cataloging-in-Publication Data 

Mouftah, Hussein T. 

Optical networks : architecture and survivability / by Hussein T. Mouftah, Pin-Han Ho. 
p. cm. 

Includes bibliographical references and index. 

ISBN 978-1-4613-5426-0 ISBN 978-1-4615-1 169-4 (eBook) 

DOI 10.1007/978-1-4615-1169-4 

1. Optical communications. 2. Computer network architectures. 3. Computer 
networks-Reliability. I. Ho, Pin-Han, 1971-11. Title. 

TK5105.59 .M68 2002 
621.3827-dc21 

2002032122 



Copyright © 2003 by Springer Science+Business Media New York 

Originally published by Kluwer Academic Publishers in 2003 
Softcover reprint of the hardcover 1st edition 2003 

All rights reserved. No part of this work may be reproduced, stored in a retrieval 
system, or transmitted in any form or by any means, electronic, mechanical, 
photocopying, microfilming, recording, or otherwise, without written permission 
fi*om the Publisher, with the exception of any material supplied specifically for the 
purpose of being entered and executed on a computer system, for exclusive use by 
the purchaser of the work. 



Permission for books published in Europe: permissions@wkap.nl 

Permissions for books published in the United States of America: Dermissions@wkap.com 

Printed on acid-free paper. 



The Publisher offers discounts on this book for course use and bulk purchases. 
For further information, send email to <joanne.tracy@wkap.com> . 




CONTENTS 



PREFACE xi 

1 INTRODUCTION 1 

1.1 The Optical Internet HAS COME OF AGE 1 

1.2 Evolution OF THE IP Networks 2 

1.3 Design Objectives of Optical Networks 4 

1.3.1 Survivability 4 

1.3.2 Scalability 5 

1.3.3 Class of Service 6 

1.3.4 Capacity-Efficiency 8 

1.4 Summary 9 

2 CONTROL AND MANAGEMENT ARCHITECTURE 11 

2. 1 Multi-Protocol Label Switching (MPLS) 11 

2.1.1 Establishment of Label Switched Paths (LSPs) 12 

2.1.2 Packet Forwarding and Label Swapping 13 

2.1.3 Summary of the MPLS-based Control Plane 15 

2.2 Migration from the Electronic Domain to the Optical Domain 
16 

2.2 . 1 Analogies between Control Planes for LSRs and OXCs 18 

2.2.2 Heterogeneity between the Control Planes upon LSRs and OXCs. 
18 

2.2.3 Optical Extensions to the Routing and Signaling Protocols 19 

2.2.3. 1 Extension to the OSPF-TE: 20 

2.2.3.2 Extension to the RSVP-TE 21 

2.3 Generalized MPLS Architecture 22 

2.3.1 Traffic Hierarchy Defined in Generalized MPLS. 22 

2.3.2 Functional Architecture ofMG-OXCs 23 

2.3.3 Switch Descriptions 24 

2.4 Routing and Signaling Models 27 

2.4.1 Service Models 27 

2.4.1. 1 Domain Services Model 27 

2.4. 1 .2 Unified Services Model 29 

2.4.1.3 Routing Models 30 

2.5 Summary 31 

References 31 

Questions 32 

3 ROUTING AND WAVELENGTH ASSIGNMENT (RWA): OVERVIEW 
35 




VI 



3.1 Introduction 36 

3. LI Static RWA Process 36 

3.1.2 Dynamic RWA Process 37 

3.2 Constraints on Routing and Wavelength Assignment 38 

3. 2. 1 Physical Constraints 38 

3.2.2 Diversity Constraints 39 

3. 2. 3 Wavelength Continuity Constraint 40 

3.3 Network Modeling 41 

3. 3. 1 Wavelength Graph with Extra Nodes 42 

3.3.2 Simple WG with Modified Dijkstra 's Algorithm 46 

3.3.3 Shortest Path Algorithm with Wavelength Graph ( SPAWG) 49 

3.4 Dynamic Path Selection 49 

3. 4. 1 Adaptive Routing 50 

3. 4. 2 Dynamic Wavelength Assignment 53 

3.4.3 Link‘ State Coherence 55 

3.5 Static Routing and Wavelength Assignment 57 

3.5.1 RWA by Integer Linear Programming 58 

3.5.2 RWA with Heuristic Algorithms 60 

3.6 Summary 62 

References 62 

Questions 63 

4 ALGORITHMS FOR DYNAMIC ROUTING AND WAVELENGTH 
ASSIGNMENT 65 

4. 1 Network Planning Algorithms 65 

4.1.1 Weighted Network Link-State 67 

4.1.2 Planning for Fixed Alternate Routing 75 

4. 1 .2. 1 Assumptions in Network Environment Modeling 77 

4. 1 .2.2 Optimization for Alternate Paths 78 

4. 1 .2.3 Design of the Number of Alternate Paths 79 

4. 1 .2.4 A Heuristic Algorithm 86 

4. 1 .2.5 Verification of the Algorithm 87 

4.2 Routing and Wavelength Assignment with Criticality 

Avoidance 88 

4.2.1 MIRA 89 

4. 2. 2 Asynchronous Criticality Avoidance (ACA) Protocol 92 

4.2.2. 1 Inter- Arrival Planning 93 

4.2.3 RWA with Criticality Information 95 

4.2.4 Simulation 97 

4.3 Summary 103 

References 105 

Questions: 106 




Vll 

5 ROUTING AND WAVELENGTH ASSIGNMENT WITH MULTI- 
GRANULARITY OXCS 109 

5.1 Introduction 109 

5 .2 Dynamic Tunnel Allocation with Path Selection 112 

5.3 Capacity-Balanced Static Tunnel Allocation 1 14 

5.4 Simulation 121 

5.5 Summary 127 

References 127 

Questions: 127 

6 PROTECTION AND RESTORATION 129 

6. 1 Network Survivability 129 

6.2 Survivability of Ring-based Non- WDM Networks 133 

6.2.1 Automatic Protection Switching 133 

6.2.2 SONET Self Healing Ring 134 

6.3 Survivability in WDM Mesh Networks 137 

6.3.1 A Spectrum of Protection Service 138 

6.3. 1.1 Shared Risk Link Group (SRLG) Constraint 138 

6.3. 1.2 Path-Based Shared Protection 139 

6.3.1 .3 Link-Based Shared Protection 139 

6. 3. 2 SLSP Framework. 140 

6.4 Summary 143 

References 144 

Questions 144 

7 SPARE CAPACITY ALLOCATION 149 

7. 1 Principles and Objectives of Spare Capacity Allocation 150 

7.2 Span-Oriented Spare Capacity Allocation Schemes 152 

7.2.1 Node Cover. 153 

7.2.2 Span Protection 154 

7.2.3 Ring Cover 156 

7.2.4 Pre-configured Cycle 160 

7.3 Path-oriented Spare Capacity Allocation Schemes 162 

7. 3. 1 Correlation of Working Paths 163 

7.3.2 Basis of Path-oriented Spare Capacity Allocation 168 

7.3.2.1 Relaxation of Integer Constraint: 170 

7.3.2.2 Local Search 170 

1. 3.2.2) Simulated Aimealing: 171 

7.3.2.4 Successive Survivable Routing 173 

7.4 Inter- Arrival Planning 174 

7.4. 1 Link-State Coherence 1 75 

7.4.2 Correlation of Shared Risk Link Group 176 




Vlll 



7. 4. 3 Rerouting of Protection Paths 179 

7. 4. 4 Spare Capacity Re-allocation 180 

7.5 SLSP WITH Spare Capacity Re-allocation 1 82 

7. 5 . 1 Introduction 182 

7.5.2 Interleaving of the Working Paths 185 

7. 5. 3 Optimization and Approximating Optimization 189 

7.5.3. 1 Integer Programming (InP) Formulation 190 

7.5.3.2 Local Search 190 

7.5 .3 .3 Successive Survivable Routing 192 

7.5.4 Numerical Results 193 

7. 5. 5 Design for the Size of Each Subset (Mw) 194 

7. 5. 6 Performance Evaluation 196 

1.5.6.x Allocation of Protection Domains 197 

1.5. 6.2 Simulation Results 199 

7.6 Summary 203 

References 205 

Questions: 206 

8 SURVIVABLE ROUTEVG WITH DYNAMIC TRAFFIC 211 

8.1 Survivable Routing 212 

8.2 Solving the Asymmetrically Weighted Node-Disjoint Path- 

Pair Problem 213 

8.2. 1 Iterative Two-Step-Approach 215 

8. 2. 2 Suurballe ’s Algorithm 216 

8.2.3 Integer Linear Programming Formulation (ILP) 21 7 

8.2.4 Linear Relaxation 218 

8.2.4.1 Single Flow Relaxation 219 

5.2.4.2 Linear Programming Relaxation (LPR) 222 

5.2.4.2. 1 Reparable Split 222 

5.2.4.2.2 Unreparable Split 223 

8.3 Diverse Routing for Shared Protection 226 

8. 3. 1 Cost Functions 226 

8. 3. 2 Derivation of Protection Paths 228 

8.4 Dynamic Allocation of Protection Domains 23 1 

8. 4. 1 Simulations 236 

8 .4. 1 . 1 Assumptions 236 

8.4. 1 .2 Simulation Results 237 

8.5 Summary 240 

References 241 

Questions: 241 

9 OPTICAL BURST SWITCHING 247 

9. 1 Introduction to Switching Techniques 248 




IX 



9.2 Reservation Schemes 252 

9.2.1 Resource Reservation in Circuit-Switching and Packet-Switching 

Networks 253 

9.2.2 Resource Reservation in Optical Burst Switching (OBS) 256 

9.2.2.1 Just-In-Time (JIT) 257 

9.2.2.2 Channel Scheduling in JIT OBS 259 

9.2.2.3 Just-Enough-Time (JET) 261 

9.2.2.4 QoS Support with JET OBS 263 

9.3 Node Architecture 267 

9.4 Summary 271 

References 274 

Questions: 275 

APPENDIX A; LOOP-LESS K-SHORTEST PATHS ALGORITHM IN 
DIRECTED GRAPHS 279 

A.1 Introduction 279 

A.2 Yen’s Algorithm 280 

References 281 

APPENDIX B: MAXIMUM-FLOW ALGORITHM 283 

ACRONYMS 285 

SYMBOLS 289 

GLOSSARY 291 



INDEX 



,297 




PREFACE 



This book is intended for use by anyone interested in the development of 
optical Internet and Metropolitan area networks. As the topics of the book 
cover a broad spectrum of state-of-the-art optical network design and 
management schemes, the potential readers may be those working as 
research & development engineers, graduate students studying wavelength- 
routed WDM networks, and senior undergraduate students with a 
background in algorithms and networking. This book may also supplement 
readings for any graduate course that touches the topics of internetworking, 
routing, survivability, and network planning algorithms. 

We are inspired to write this book by the rapid development and advance 
in Internet technology, which has revolutionized telecommunications over 
the past two decades. It is highly possible that progress will maintain this 
surprisingly speed in the coming future. The lightening fast evolution of 
Internet makes the cutting-edge research topics a moving target all the time. 
However, some of the most basic ideas will always hold, and will provide a 
foundation upon which new research topics may arise. 

We believe that the development of algorithms and protocols for resource 
allocation must cooperate with the control architecture design so that the 
whole communication systems can be seamlessly integrated. The “optimal” 
design for network control and management is a function of many network 
environment parameters and assumptions, which may nevertheless be 
distorted from the practical situation without a thorough consideration. Many 
design decisions are based largely on heuristic judgments and tend to be a 
compromise between alternatives. It is our goal to convey these notations to 
the readers. 

Let us review the topics covered in sequence, chapter by chapter. 

Chapter 1 provides opening remarks for the book as well as brief 
descriptions of the evolution of IP networks and the Multi-Protocol Label 
Switching (MPLS) based control plane. The design objectives are outlined to 
provide a basis for further discussions later in this book. 

Chapter 2 introduces the control and management architecture for 
wavelength-routed WDM mesh networks, including the industry and 
standard setting progresses for the optical Internet. This chapter begins with 
an overview of the MPLS control plane, in which the most important issues 
that characterize the newly standardized protocol are provided. Based on the 
MPLS control plane, the Generalized MPLS control architecture is examined 
from both the signaling and switching architecture phases. Several models 
for routing and signaling are given hereafter, which define the 
interoperability of different protocol constructs. 
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Chapter 3 provides an overview of the issues surrounding routing and 
wavelength assignment (RWA), which includes the reported algorithms and 
heuristics for performing static and dynamic path selection processes. 

Chapter 4 considers more practical situations concerning dynamic routing 
and wavelength assignment problems in which constraints are imposed on 
the path selection, such as the physical, service and administrative 
constraints. To improve throughput, a couple of planning algorithms are 
included. 

Chapter 5 discusses routing and wavelength assignment algorithms for 
networks with multi-granularity traffic in the optical layer. We examine a 
hardware-based approach to implementing path selection with multi- 
granularity traffic based on the switching architecture of MG-OXCs 
provided in Chapter 2. 

Chapter 6 introduces the state-of-the-art in protection and restoration in 
the optical layer, which includes an overview of the ring-based and mesh- 
based protection schemes. Standardized restoration mechanisms such as the 
SONET Self-healing Ring and Automatic Protection Switch (APS) are 
presented. An introduction to the MPLS-based recovery is given, which 
discusses different types of design originality, including the path-based, link- 
based, and Short Leap Shared Protection. 

Chapter 7 focuses on the spare capacity allocation schemes. Under the 
assumption that all the working capacity is known to the network control 
plane, the optimal or near-optimal deployment of spare capacity in the 
optical networks can be conducted in a static manner. We also provide a 
framework for improving performance in dynamic networks using the static 
approaches. Our goal is to fit the time-consuming optimization processes 
into a dynamic network, where coimection requests arrive one after the other 
without any knowledge of future arrivals. 

Chapter 8 introduces survivable routing algorithms with a special focus 
on optimality in capacity and computation efficiency. Diverse routing 
algorithms for both dedicated and shared protection are addressed. 

Chapter 9 describes a different switching architecture from the 
wavelength-routed approach - Optical Burst Switching (OBS). The focus is 
placed on the reservation schemes that have been proposed for the OBS, 
including Just-In-Time (JIT) and Just-Enough-Time (JET). A comparison is 
conducted between different switching techniques including Optical Packet 
Switching, Lambda-Switching, and OBS. Node architectures that implement 
the OBS are also described. 




Chapter 1 



INTRODUCTION 



1.1 The Optical Internet has come of age 

The Internet has revolutionized the computer and communications world 
like nothing before, and has permanently changed the life styles of human 
beings. The Internet is committed to providing at once a world-wide 
broadcasting capability, a mechanism for information dissemination, and a 
medium for collaboration and interaction between individuals and their 
computers without regard for geographic location. As the importance of the 
Internet grows, the strategies of constructing the infrastructure become more 
critical. The design of the Internet architecture has been a focus by 
researchers in industry and academia for the past decades, and a wider 
understanding of these strategies has never been more essential than at 
present. 

The Internet is a core network, or a Wide Area Network (WAN), which 
cuts across continents and countries for the purpose of interconnecting 
hundreds, or even thousands, of small-sized networks such as Metropolitan 
Area Networks (MANs) and Local Area Networks (LANs). These small- 
sized networks are also called access networks, which upload or download 
data flows to or from the core network through a traffic grooming 
mechanism. 

The idea of constructing networks with fiber optics first appeared with 
break-throughs in opto-electronics in the early 1980s. However, most of 
them were focused on access networks such as the Fiber Distributed Data 
Interface (FDDI), as well as Synchronous Optical Network (SONET) ring 
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networks. With advances in optical technologies, (most notably dense 
wavelength-division multiplexing (DWDM)), the amount of raw bandwidth 
available on fiber optic links has increased by several orders of magnitude, 
in which a single fiber cut may interrupt a huge amount of data traffic in 
transmission. For keeping service continuity and data integrity, we need 
protection and restoration mechanisms to make the Internet survivable to any 
failine. As a result, survivability issues emerged as the prevalence of using 
optical fibers with the DWDM technology in the Internet core. The adoption 
of a survivable optical Internet is committed to reducing network operational 
cost and enabling versatile multimedia applications through the 
simplification of administration efforts and a guarantee of service continuity 
(or system reliability) during any network failure. To build the next 
generation optical Internet, a suite of interoperable protocols must be defined 
to form a control plane upon which the networks can be survivable, scalable, 
cost-efficient, and revenue-generating by provisioning multi-class 
connection services. 

In terms of interconnecting architectures, the next generation optical 
Internet is expected to be a DWDM-based mesh network with multiple fibers 
(e.g., one to several tens) connecting each pair of nodes. With this 
architecture, a new research paradigm has been brought up to where the 
network architecture designers have never been before. Since DWDM 
technology has vastly increased the bandwidth along a single fiber, we now 
need new control and management strategies to make the most use of this 
revolutionary improvement in the enabling hardware. The existing network 
control architectures, such as IP over ATM (Asynchronous Transfer Mode) 
and IP over ATM over SONET (Synchronous Optical Network), have their 
own original design premises. However, due to the overlaid functions 
between different layers, an extra amount of control efforts is taken. When 
we are planning for the next generation Optical Internet, those overlays in 
control planes should be avoided as much as possible. A new architecture for 
provisioning hundreds of terabit per-second with scalable control and 
management is desired. 

The new network control and management plane should be real-time re- 
configurable and able to on-demand provision bandwidth with class of 
service. The control protocols designed for the new architecture must be 
scalable and fault-resilient, and be able to accommodate progress in state-of- 
the-art optical technologies. 

1.2 Evolution of the BP Networks 

Since the late 1980s, router-based cores have been largely adopted by 
service provides, where traffic engineering (TE) (i.e., the manipulation of 
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traffic flow to improve network performance and satisfy specific service 
requirements), was realized by simply manipulating cost functions and link- 
states metrics of interior gateway protocol (IGP), such as Open Shortest Path 
First (OSPF) and Intermediate-System to Intermediate-System (IS-IS). With 
the increase of Internet traffic, a number of limitations to bandwidth 
provisioning and TE requirements have emerged. 

Firstly, software-based routers had the potential of becoming traffic 
bottlenecks under heavy load. However, hardware-based IP switching 
apparatus (which will be defined in the following paragraphs) are vender- 
specific and expensive. 

Secondly, static metric manipulation was not scalable and usually 
required a trial-and-error approach rather than a scientific solution to an 
increasingly complex problem. Even after the addition of TE extensions with 
dynamic metrics, such as maximum reservable bandwidth of links to the 
IGPs (TE extensions will be detailed in Chapter 2), the improvements are 
very limited. 

To overcome the problems of bandwidth limitation and TE requirements, 
Application Specific Integrated Circuit (ASIC) based switching devices had 
been developed since the early 1990s. As prices of such devices became 
cheaper, IP over ATM was largely adopted and provided benefits for the 
Internet Service Providers (ISPs) in terms of the high-speed interfaces, 
deterministic performance, and TE capability by manipulating permanent 
virtual circuit (PVC) in the ATM core networks. 

While the IP over ATM core networks initially solved problems by using 
the IP routing cores for ISPs, the expenses of increased complexity in 
overlaying IP and ATM control mechanisms stimulated the development of 
various multi-layer switching technologies, such as Toshiba’s IP-Switching 
and Cisco’s Tag-Switching. However, these multi-layer switching 
architectures are vendor-specific and have difficulties in intercoimection. 

The evolution of control and management for the IP networks began a 
new era in 1998, when Multi-Protocol Label Switching (MPLS) was 
standardized in Request For Comment (RFC) by the Internet Engineering 
Task Force (IETF). MPLS was developed based on Cisco’s Tag Switching, 
and designed to unify the vendor-specific multi-layer switching 
architectures. The control plane based on the MPLS protocol (also called an 
MPLS-based control plane) is an extension of the original MPLS protocol to 
a specific network environment (e.g., the optical domain). The MPLS-based 
control plane has become one of the most promising candidates for the next 
generation optical Internet, aimed at satisfying the explosively growing 
bandwidth demand and offering versatile multimedia applications. Due to a 
clean separation between the upper control layer and the underlying 
transportation layer, the MPLS-based control plane is intrinsically suitable 
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for performing multi-layer switching. It has been equipped with a complete 
solution for protection and failure recovery, and also famous for having a 
platform to facilitate traffic engineering. In addition, the MPLS-based 
control plane can be easily applied to multi-vendor interconnections and 
different overlay possibilities. Unlike the framework of IP over ATM in 
which two separate routing information dissemination and signaling 
mechanisms (i.e., IP and ATM forum signaling) are overlaid, the MPLS- 
based control plane is able to provide an integrated service across the IP 
layer and underlying transportation layer. Another important reason for its 
prevalence is that the adoption of the MPLS-based control plane keeps most 
of the existing TCP/IP software artifacts working, which will save “MPLS- 
based ISPs” innumerable labor hours by not requiring them to rebuild a new 
system from the ground up. The migration from packet-switched MPLS 
protocols to wavelength-routed optical networks has occurred during the last 
few years, most notably at the instigation from the standard setting institutes 
such as the IETF, the Optical Interconnection Forum (OIF), and the 
International Communication Union T1 group (ITU-Tl). 

As the trend towards the adoption of the MPLS-based control plane 
continues, the first Internet Draft of the universal version of MPLS signaling 
- Generalized MPLS (GMPLS), was introduced by IETF in the mid 2000. 
The emergence of GMPLS provides a uniform platform of control and 
management on interfaces (i.e., fiber-switching, waveband-switching, 
lambda-switching, time-division multiplexing, and packet-switching 
interfaces) for some of the most frequently used carriers of switching types 
in the next-generation optical Internet. Therefore, with the advances in 
technologies of all-optical photonic switches, the next generation Internet is 
expected to provision traffic with fine granularity, versatile multiplexing 
types, and uniformly-defined protocols with migratable software artifacts. 

1.3 Design Objectives of Optical Networks 

This section introduces foiu: important design principles and objectives 
upon which the discussions in the rest of the book are based. The issues of 
interest include survivability, scalability, class of service, and capacity- 
efficiency. 

1.3.1 Survivability 

A network is considered to be survivable if it can maintain service 
continuity to the end users during the occurrence of any failme on 
transmission media, switching devices, and protocols, by a suite of real-time 
mechanisms of protection (e.g., traffic monitoring and fault localization). 
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along with a pre-planned restoration mechanism (e.g., traffic switchover) 
from the failure within a certain amount of time. Network survivability has 
become a critical issue as the prevalence of DWDM technology in the 
optical core, by which a single fiber cut may influence a huge amount of 
bandwidth in transmission and cause service interruptions to innumerable 
end users. Typically, Internet backbone networks are overbuilt in 
comparison with average traffic volumes, in order to support fluctuations in 
traffic levels and to stay ahead of traffic growth rates. With under-utilized 
capacity in networks, the most widely recognized strategy is to find 
protection resources that are physically disjointed (or diversely routed) from 
corresponding working paths, over which affected data flow can be switched 
to the protection paths during any failure of network elements along the 
working paths. 

Network faults can be divided into four categories: Path Failure (PF), 
Path Degraded (PD), Link Failure (LF) and Link Degraded (LD). PD and 
LD are the results of Loss of Signal (LoS), in which the quality of the optical 
flow is unacceptable to lightpath terminating nodes. To cope with this type 
of failure, a pre-determined end-to-end path that is physically disjointed 
from the working path is desired. The reason is that fault localization cannot 
be conducted with respect to LoS along the intermediate optical network 
elements in an all-optical network. For protection of the LoS failure, 
preparation of spare network capacity is the most commonly adopted 
strategy. 

In the cases of PF and LF, the continuity of a link or a path is damaged 
(e.g., a fiber cut). This kind of failure can be detected by a Loss of Light 
(LoL) detection performed at each optical network element so that fault 
localization can be easily performed. In general, all nodes are assumed to be 
capable of detecting an LoL fault in the optical layer, which can be 
performed via a tapping mechanism in output port of a switch. In such cases, 
an optical detector residing in an optical amplifier at each port of a node 
monitors power levels of all outgoing fibers. An alarming mechanism is 
executed at the underlying optical layer to inform the upper control layer of a 
failure once a power level abnormality has been detected. 

1.3.2 Scalability 

In the network with static traffic, where all traffic demand is defined 
prior to network operation, the issue of scalability refers to the size of the 
network (i.e., the number of nodes and links in the network). An 
optimization process for deploying working capacity is most likely 
performed at the network planning stage, and the design objective for 
routing and signaling protocols in such networks should lie on the 
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improvement of capacity-efficiency. For a network with dynamic traffic, 
where connection/disconnection requests, or network events, arrive at the 
network one-by-one without any prior knowledge of future arrivals, the 
scalability issues are not only limited to the size of the network, but are also 
subject to the characteristics of the traffic pattern. The important factors 
include Traffic dynamicity, Traffic granularity, and Provisioning speed. The 
definitions of the above three characteristics of dynamic traffic are important 
to understand. 

Traffic dynamicity refers to the average inter-arrival time period between 
two consecutive network events. The larger the traffic dynamicity, the 
shorter the time duration between two consecutive network events. 

Traffic granularity: An optical network has to define the traffic 
granularity before bandwidth can be provisioned. The most commonly used 
scaling system is OC-n. For example, OC-3 stands for the transmission 
speed of 155.52 Mbps, and OC-12 is 4 X 155.52 = 622.08 Mbps. When the 
network control considers smaller traffic granularity in the optical layer, the 
number of paths in networks is getting larger, thus the traffic dynamicity is 
getting larger. A lightpath in the optical layer can most likely accommodate 
Wdwidth from OC-12 to OC-768, which depends on the type of the 
network. For a Wide Area Network in which the transmission is long-distant 
and the bandwidth provisioning is large across countries or continents, a 
larger granularity of a lightpath is preferred for achieving cost-efficiency. On 
the other hand, in a Metropolitan or Local Area Network, smaller granularity 
of traffic is handled due to the fact that the traffic may be more dynamic and 
changing from time to time. 

Provisioning Speed refers to how fast a lightpath is provisioned after a 
connection request is issued. The efficiency of routing algorithms must scale 
with the requirement of the provisioning speed of coimection requests. 

1.3.3 Class of Service 

In the network optical layer, evaluating Quality of Service for a 
connection request is no longer limited to delay, jitter, or packet-discard 
policies which play major roles in differentiating services in traditional 
packet-switching networks. In general, there have been two criteria (or 
metrics) defined in services of bandwidth provisioning in the optical domain, 
namely Provisioning priority and Restoration time. The development of a 
scalable management upon these two QoS metrics is an on-going progress 
and will be a major focus in this book. We give them each a brief 
introduction in the following paragraphs. 

Provisioning Priority: Provisioning priority has been one of the most 
critical issues in routing and wavelength assignment processes for both static 
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and dynamic networks. Provisioning priority can be treated as a QoS metric 
in the optical layer because the successful setup of a lightpath implies an 
immediate service available to a group of end users with the same (or 
similar) QoS requirement, hmumerate research efforts have been made for 
equipping the optical layer with the capability of swiftly provisioning 
lightpaths. According to nowadays technology, a queue-length-sensitive 
automatic provisioning of lightpaths is still a great challenge, where the 
requirement that tens of milli-seconds of provisioning time can be obtained. 
One of the difficulties is that a routing decision is usually made after 
consuming a relatively long period of time compared with the provisioning 
latency. Approaches for routing and wavelength assignment based on 
dynamic per-wavelength link-state may take more than tens of milli-seconds, 
and are not feasible in this case. In addition, problems in the scalability of 
dissemination and a stale link-state phenomenon brought by the 
dissemination also deter the progress of research advance. 

In order to distinguish commercial importance of each lightpath, the 
connection requests with lower classes of service should suffer a greater 
chance of blocking. When a contention occurs, lower-prioritized requests 
should be buffered for the next available lightpath, or merged into the other 
ligthtpath of the same class of service. On the other hand, the connection 
requests with higher classes of service should have a smaller risk of being 
blocked for acquiring service right after a request is issued. The differences 
between each class are mainly sourced from the two phases of signaling and 
service priority. In the signaling phase, the resource reservation protocols, 
such as the Resource Reservation Protocol (RSVP) or the Constraint-based 
Label Distribution Protocol (CR-LDP), support the preemption and holding 
mechanisms, and can resolve contentions between two connection requests 
that intend to reserve network resources in common. In the service phase, on 
the other hand, even if a lower class lightpath has been set up, a higher class 
lightpath should be able to interrupt the lower class one to guard itself fi-om 
an immediate blocking. 

Restoration Time: Restoration time influences the service continuity 
and data integrity to the end users while any network element is subject to 
failure, and is a QoS metric for bandwidth provisioning in case the 
survivability is equipped. We need to justify the following design principle 
in different types of networks when manipulating this metric. The factors 
influencing the restoration speed include the protection mechanisms adopted 
(e.g., pre-planned versus real-time searching), the way of arranging spare 
capacity (e.g., dedicated versus shared), and the survivable routing strategy 
adopted (e.g., path-based, link-based or Short Leap Shared Protection). 
This book is intended to distinguish these protection and restoration 
strategies thoroughly. 
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Restoration speed should also be distinguished according to the service 
types. Applications with stringent service continuity requirements, such as 
hitemet computing, on-line banking, and transactions on commercial 
commodities, need to be restorable from a failure within a very short period 
of time to guarantee data integrity. A long restoration time to these 
applications may severely diminish the effects of restoration. On the other 
hand, some applications can tolerate a longer restoration time and a larger 
amount of data loss, such as Internet phone, Internet TV, and Internet 
conferencing, which need only a best-effort restoration service. The 
restoration speed for a specific connection request should be considered in 
the billing policies which depend on how much the end-user are charged. 

In terms of the operation economics of ISPs, a comparison between Wide 
Area Networks and Metropolitan Area Network highlight the differing 
emphasis on the design objective of restoration speed. For the Wide Area 
Networks, the bandwidth along a fiber can be up to a hundred times larger 
than that in the Metropolitan-Area Networks. Thus, a dedicated protection is 
preferred along most of the physical conduits for signaling simplicity, and 
also to ensure the service continuity with the fastest restoration possible to 
millions of end users. Typically, the Wide-Area Networks are also sparsely 
meshed; as a result, a design on sharing of spare capacity among different 
working paths may yield little effect. Due to the fact that the Metropolitan- 
Area Networks are usually densely meshed, the sharing of protection paths 
can bring a significant improvement in throughput. Since there are relatively 
much less end users corresponding to a node, and a small amount of data 
flow in a fiber as well as a shorter distance between source and destination, 
the TCP error correction and buffering mechanisms can stand for a longer 
duration of time without losing any data after the occurrence of a failure. 

From this point of view, a framework of class of service for the optical 
networks can be defined with the provisioning priority and restoration speed, 
which is the first step to setup a network with a multi-service enviromnent. 

1.3.4 Capacity-Efficiency 

One of the most important ways by which ISPs can increase revenue 
from their Internet services is by accommodating more coimection requests 
into their networks at a moment. In general, the more bandwidth that is 
provisioned, the greater the revenue is generated. (It is not certainly true if 
some connections are much more valuable than some others). Capacity- 
efficiency is defined as a measurement of effective bandwidth that a network 
can provision at a moment. 

The capacity-efficiency of an optical network can be evaluated 
alternatively with a couple of performance indexes. The choice of the 
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performance index strongly depends on the network enviromnent and design 
objectives. The performance indexes could be throughput, which is defined 
as the amount of bandwidth provisioned within a given period of time. If we 
intend to evaluate the capacity-efficiency in equipping networks with 
survivability, a performance metric may be defined as the amount of spare 
capacity required to achieve a full restorability for all working capacity, or 
the amount of working capacity to be protected with a specific arrangement 
of spare capacity. The metric of throughput is most likely used in a network 
with static traffic, in which bandwidth demand between each source-to- 
destination pair is given prior to the operation of the network. 

For a network with dynamic traffic, blocking probability usually serves 
as a performance index for evaluating the performance. Blocking probability 
is defined as the probability for a connection request to be rejected under a 
certain arrival and departure rates of network events. In the following 
chapters, we will largely adopt blocking probability as the performance 
index to analyze different control and management schemes in networks 
with dynamic traffic. 

1.4 Summary 

This chapter has provided a short description of the on-going progress for 
the control and management in the optical networks. The first two sections 
described the evolution of IP networks and sketched what the next 
generation optical Internet may look like according to the evolutionary locus. 
The MPLS-based control plane was then annotated as the most promising 
control architecture for the future optical networks, and will be taken as the 
basis for the discussions in the rest of the book. The third section presented 
several design principles for an optical network, which included the issues of 
survivability, scalability, class of service, and capacity-efficiency. With the 
framework provided in this introductory chapter, the following chapters 
discuss in detail a variety of topics in design of control plane for the WDM 
optical networks. 




Chapter 2 



CONTROL AND MANAGEMENT 
ARCHITECTURE 



This chapter covers the topics of control and management architectures 
on the core of the optical Internet. The chapter starts with an introduction of 
Multi-Protocol Label Switching (MPLS) protocol, which has been 
recognized as the most promising framework for the next-generation 
Internet. To accommodate some special characteristics in dealing with traffic 
flows in the optical domain (such as optical physics and some strong 
domain-specific requirements for networks with different technologies), a 
migration of the MPLS protocols to the optical domain has been a research 
focus of much interest in the past years. The efforts are summarized as 
Generalized MPLS (GMPLS) [6] signaling protocols in Internet Draft by 
Internet Engineering Task Force (IETF). In this chapter, an overview on the 
GMPLS protocols is given, which includes switching architecture and 
elementary functional blocks for building an optical core for the next 
generation Internet. This chapter also addresses issues of multi-granularity 
OXCs (MG-OXCs) that cooperate with the GMPLS switching architecture 
and signaling mechanisms. 

2.1 Multi-Protocol Label Switching (MPLS) 

MPLS finalizes the evolution of multi-layer switching of Internet 
technology, and is built upon the efforts of various proprietary multi-layer 
switching solutions. MPLS is famous for its strong functionality in traffic 
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engineering and class of service, and can provide a robust and scalable 
platform for facilitating control and management on the Internet. We will 
introduce in this section the MPLS protocol [1,2]. 

2.1.1 Establishment of Label Switched Paths (LSPs) 

A network domain that accommodates MPLS routing and signaling 
protocols is called an MPLS domain. The paths set up by the MPLS 
signaling mechanisms are called Label Switched Paths (LSPs), through 
which MPLS packets (i.e., IP packets cast with an MPLS header) can be 
forwarded according to MPLS forwarding and Label Swapping protocols 
(which will be introduced in Section 2.1.2). Instead of using the data-driven 
mechanism adopted by most connectionless packet-switching networks, an 
MPLS domain provides coimection-oriented traffic streams. Resource 
reservation and setup of LSPs are performed through a suite of existing IP 
protocols with Traffic Engineering (TE) extensions: based on the Resource 
Reservation Protocol (RSVP) and the Constrain-based Label Distribution 
Protocol (CR-LDP). LSPs are simplex in nature (i.e., a data flow is in one 
direction from the source node to the destination in the MPLS domain). 
Duplex traffic requires two LSPs, one LSP to carry traffic in each direction. 
An LSP is created by concatenating one or more Label Switch Routers 
(LSRs), which is a router equipped with the existing IP protocols with TE 
extensions as well as the MPLS-based forwarding plane. 

An illustration of the MPLS domain with LSRs and IP routers is shown 
in Fig. 2.1. An LSP is functionally equivalent to a virtual circuit with 
specific bandwidth, and is defined as an ingress-to-egress path through a 
network. Along an LSP, all packets assigned to specific forwarding 
equivalent classes (FECs) are transmitted. Note that an FEC is assigned to an 
MPLS packet by an egress node (i.e., LSR A or C in Fig. 2.1) to a group of 
sessions along the corresponding LSP with similar class of service 
requirements. It can also be a set of packets that are forwarded over the same 
LSP through a network even if their ultimate destinations are different. For 
example, in conventional longest-match IP routing, the set of uni-cast 
packets whose destination addresses map to a given IP address prefix is an 
example of an FEC. 

The first and last LSR along an LSP is called ingress and egress LSR of 
the LSP, respectively. The process of LSP setup and label distribution are 
performed after the decision of path selection is made in the egress node if 
source routing is adopted. For setting up an LSP, a label is distributed to 
each LSR along the LSP. With RSVP, the PATH message is propagated 
from soiurce to potential recipient(s). The receiver interested in 
communicating with the source sends the RESV messages for the 
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acknowledgement purpose. With CR-LDP, a Label Request message is used 
by an upstream LSR to request a label binding from the downstream LSR for 
a specific forwarding equivalency class (FEC) and CR-LDP. In both cases, a 
“soft state” is kept in each LSR for an LSP, which means that the LSP is 
kept active through a constant probing between each neighboring LSRs 
along the LSP, and is tom down until the probing stops. In other words, no 
specific signaling message is devised to tear down an LSP. 



IP Domain MPLS Domain IP Domain 




Fig. 2.1. An illustration of the MPLS domain with LSRs and IP routers. 

2.1.2 Packet Forwarding and Label Swapping 

The forwarding component of an LSR supports hardware-based packet 
forwarding and label swapping mechanisms. In the MPLS domain, LSRs 
may ignore the packet's IP layer header and simply forward the packet using 
the label-swapping algorithm. A label can be an integer number, and is 
carried by an MPLS packet in the header, by which the packet is switched in 
an LSR to a pre-determined output port to follow the LSP. When a labeled 
packet arrives at a switch, the forwarding component uses the input port 
number and the label of the packet to perform an exact match search of its 
forwarding table. When a match is hit, the forwarding component retrieves 
the outgoing label and the outgoing port number from the forwarding table. 
The forwarding component then swaps (or replaces) the incoming label with 
the outgoing label and directs the packet to the output port for transmission 
to the next hop in the LSP. When the labeled packet arrives at the egress 
LSR, the forwarding component still searches its forwarding table. If the 
next hop is not an LSR, the egress LSR discards the label and forwards the 
packet using conventional longest-match IP forwarding. 

The mapping of (input port number, label of this LSR) to (output port 
number, label of downstream LSR} specified in the forwarding table created 
when the LSP is set up ensures the packets with the same label from an input 
port be forwarded to the same output port of the LSR with a new label. Since 
a packet may be assigned different labels corresponding to different LSRs 
along an LSP, the forwarding plane in each LSR along the LSP has to swap 
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the label of the packet so that it will be forwarded to the correct output port 
in the downstream neighbor LSR of the LSP. The above mechanism is the 
same as the approaches used to forwarded data in ATM and Frame Relay 
switches, which not only guarantees the connection-orientated attribute of 
the traffic stream, but also makes the label distribution more flexible in 
networks with multiple LSPs. 

To summarize the role of LSRs in the MPLS domain, an LSR is 
composed of two distinct functional components; a control component and a 
forwarding component. The control component uses standard routing 
protocols, such as the Open Shortest Path First (OSPF), IS-IS (Intermediate 
System-Intermediate System), and BGP4 (Border Gateway Protocol version 
4), to exchange information with other routers to build and maintain a 
forwarding table. The control component is also equipped with signaling 
protocol devices for handling requests to set up lightpaths, such as RSVP or 
CR-LDP. The forwarding component, on the other hand, is usually 
hardware-based and executes packet forwarding and label swapping for each 
MPLS packet in switching fabric of an LSR. 

Due to the fact that the packets with the same label are forwarded to the 
same output port in an LSR, the connection-orientated characteristic of 
MPLS traffic stream is guaranteed, in which the MPLS packets are 
transported in a fixed order and follow the same physical route. The above 
mechanism distinguishes the MPLS packet forwarding from the 
connectionless characteristic of IP packets such that quality-of-service can 
be stressed. Besides, instead of using the longest match of IP address to route 
packets in the traditional IP domain all the way, an LSR forwards an MPLS 
packet by inspecting the label in the packet’s header. As a result, the 
processing delay in transporting the packet in the MPLS domain is 
significantly reduced by using the hardware-based forwarding components. 
The above two characteristics yield a clean separation between the control 
and forwarding (or transportation) planes, which are among the most 
important reasons for favoring the MPLS protocols. By completely 
separating the control component from the forwarding component, both 
functional components can be independently developed and modified. The 
control component communicates with the forwarding component by 
manipulating the packet-forwarding table. 

An MPLS packet has a 32-bit header carried to identify its FEC as shown 
in Fig. 2.2. A label is analogous to a connection identifier, which plays a 
similar role to an ATM Virtual Path IdentifierA^irtual Circuit Identifier 
(VPWCI) or a Frame Relay Data Link Connection Identifier (DLCI) in that 
the header has only node-local significance. A label is also encoded with 
routing information specified in the network layer header, which maps the 
packet to a specific FEC. 
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Fig. 2.2. An illustration of IP packet that contains MPLS header. 

In Fig. 2.2, a 32-bit MPLS header contains the following 4 fields: 

1 . The label field (20-bits) carries the value of MPLS label, which can be 
ranged from 0 to 2“ - 1 . 

2. The Class of Service (CoS) field (3-bits) determines the queuing and 
discard policies applied to the packet as it is transmitted through an 
MPLS domain. 

3. The Stack (S) field (1-bit) shows whether the label is at the top of 
hierarchy. 

4. The TTL (time-to-live) field (8-bits) provides conventional IP TTL 
functionality. 

It is clear that an MPLS packet is nothing but an IP packet masked with 
an MPLS header, which generates a “shim layer” [2] between IP control and 
transportation plane. Different from the IP over ATM, the MPLS shim layer 
does not require the IP packets to be disassembled or assembled, and can 
further achieve cost-efficiency in hardware implementation. 

2.1.3 Summary of the MPLS-based Control Plane 

The most important benefits of adopting the MPLS control plane are 
summarized as the provision of a shim layer between the IP and the 
underlying data transportation layers, and the label swapping algorithm that 
makes the setup of LSPs scalable. Because of the MPLS shim layer, most 
existing TCP/IP software artifacts can still work in routers. In addition, it 
integrates control and management of the upper control plane and the lower 
forwarding plane so that the logical and physical network topology no longer 
need to be taken as two design spaces. The integration has simplified the 
routing information exchange and signaling design. MPLS has been 
concluded as an effective solution to the stringent requirements for traffic 
engineering and class of service in the packet-switching core networks. 

A functional diagram of an LSR is shown in Fig. 2.3. The IP/MPLS 
control box is equipped with the MPLS control plane, which contains the 
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following three function units. First, the Constraint-based Shortest Path First 
(CSPF) path selection module can perform path selection (or routing) upon 
different constraints such as maximum reservable bandwidth, wavelength 
availability, and diversity requirement of a link according to the TE database 
and Shared Risk Link Group (SRLG) information. Second, the extended 
RSVP or CR-LDP signaling component is activated to send a Path Message 
or an LDP Label Request Message to set up a path based on the calculation 
result from the CSPF path selection. In addition, it can also respond to the 
requests from the other LSRs for establishing connections. Third, there is an 
ISIS/OSPF routing protocol module with ISIS/OSPF optical and TE 
extension for the dynamic dissemination of link-states and TE metrics. The 
ISIS/OSPF routing module disseminates/acquLres topological and dynamic 
link-states with the other LSRs in the MPLS domain for verifying/updating 
the information in their link-state and TE database. 
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Fig. 2.3. Functional diagram of nodes with IP/MPLS control plane and 
GMPLS signaling mechanism. 

2.2 Migration from the Electronic Domain to the Optical 
Domain 

Due to some similarities between an LSP and a lightpath (including the 
connection-oriented characteristics of transmission and in-band switching), 
people have been motivated to migrate the MPLS control plane to the 
wavelength-routed optical networks by equipping the MPLS protocol with 
optical extensions. The efforts to extend the existing IP and MPLS protocols 
have been conducted by the industrial standard setting institutes, such as 
IETF and the Optical Interconnection Forum (OIF). With the optical 
extensions, an MPLS-based control plane is defined as the integration of the 
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existing IP and MPLS protocols with traffic engineering and optical 
extensions, which adopts the Generalized MPLS (GMPLS) signaling 
mechanisms as intercormection media of control and management. Here we 
define a router in the optical domain as the Optical Cross Connect (OXC), 
which contains an upper control plane (or an IP/MPLS control box) and the 
underlining Optical Layer Cross Connect (OLXC). 

To define the roles and relationship between each function module 
provided by a router in networks, we introduce a layered interconnection 
service model as a basis for the following discussion. As shown in Fig. 2.4, 
the IP over WDM architecture is adopted, where the control and data planes 
are separated by a control-to-transport interface (CTI). In the upper control 
plane is the IP/MPLS control box, which contains four sub-layers. The top 
sub-layer is the IP layer that could be composed of the existing IP and TCP 
software artifacts. The next MPLS layer provides primitives and service 
entities between the upper IP layer and the underlying routing/protection 
layer responsible for the implementation of constraint-based routing and 
MPLS-based protection. To achieve real-time and automatic provisioning 
working capacity and reconfiguration of spare resources, the bottom of the 
foin layers contains a dedicated control network, through which the 
signaling messages can be exchanged among nodes. CTI provides primitives 
such as “coimect”, “disconnect”, “bridge”, “alarm”, etc, to the upper control 
layer. Controllers at both TUs and the IP/MPLS control box are needed to 
make TUs reconfigurable. 
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The targets of the migration lie on the following two facets. The first is to 
realize the similarities between the control mechanisms over the MPLS 
forwarding plane and the OLXCs. The second facet is to extend the existing 
IP and MPLS protocols to cover some ubiquitous characteristics in the 
optical domain. The two facets are presented in the following context 
through Section 2.2.1 to Section 2.2.3. 

2.2.1 Analogies between Control Planes for LSRs and OXCs 

The migration of the MPLS control plane onto the optical domain is 
based on a few similarities between the control of the MPLS forwarding 
component (or a Digital Cross Connect (DXC) in the following context) and 
the OLXCs. The analogies are summarized as follows. 

1. Label versus Wavelength: A wavelength in a lightpath can play a similar 
role to a label in LSP in some sense. Each OLXC routes the optical flow 
(in wavelength) according to <(input port, wavelength), (output port, 
wavelength)> pair in DWDM networks, which performs analogously the 
label swapping mechanism in the MPLS networks. The adoption of 
wavelength conversion in a node may increase the dimension of the label. 

2. Both the MPLS domain and the optical domain attempt to de-couple the 
control plane and data plane. 

3. Both LSRs and OXCs have similar objectives in the aspects of control 
and management. The requirements in the elementary hinctional blocks 
are the same, such as the resource discovery, distributed routing and 
signaling mechanisms as well as working path maintenance and 
management. To be more specific, the control plane of LSRs is used to 
discover, distribute, and maintain relevant state information associated 
with the MPLS networks, and to instantiate and maintain LSPs under 
various MPLS TE rules and policies. The control plane of the OXCs, on 
the other hand, is used to discover, distribute, and maintain relevant state 
information associated with the optical transport layer, and to establish 
optical channel trails under various optical internetworking TE rules and 
policies. Because of the similarities, the migration of control plane on 
LSRs to OXCs can be performed with less effort. 

2.2.2 Heterogeneity between the Control Planes upon LSRs and 
OXCs 

Despite of the above analogies between the control planes of LSRs and 
OXCs, there are also several aspects that distinguish the control planes on 
the DXCs and OLXCs. These aspects are the reasons why the extensions to 
the existing IP protocols are needed. 
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LSPs are quite different from lightpaths in that LSPs are cheap while 
lightpaths are expensive. A label for an MPLS packet is 20-bit, which could 
have as many as 10E6 combinations. To generate a new label (or an FEC) is 
nothing but the consumption of a trivial amount of memory space in the 
LSRs along the LSP as well as some signaling efforts. To build up an optical 
channel trail, however, means much more than memory space and signaling 
efforts. Wavelengths are expensive since a fiber may only contain a few 
hundreds of channels distinguishable to the optical devices from the latest 
report. The configuration of the optical devices takes time in the order of 
milliseconds, and is much slower than the setup of the forwarding table in an 
LSR. 

Optical traffic cannot be digitally processed based on the state-of-the-art 
technology, which behaves as one of the intrinsic difficulties of control over 
the optical domain. LSRs can provide cheap monitoring mechanisms on 
LSPs while OXCs cannot monitor the optical signal without optical-to- 
electronic-to-optical (O/E/0) conversion. The O/E/0 conversion consumes 
transceivers, and represents an expensive resource in optical networks. In 
addition, an optical data stream caimot be delayed or buffered except by 
using fiber delay lines (FDLs), which are loops of fiber with fixed lengths 
dedicated to delaying the optical traffic for a fixed amount of time. In 
general, FDLs can only delay optical traffic for a very short time (i.e., 
several tens ns) while with relatively high cost. 

2.23 Optical Extensions to the Routing and Signaling Protocols 

Extensions to the existing IP protocols, such as OSPF-TE [12], RSVP- 
TE, and ISIS-TE [13], as well as the newly designed signaling protocol, CR- 
LDP, all emerged in the early 1998, and have since served as the routing and 
signaling protocols for the MPLS-based packet-switched networks. With 
these extensions, some dynamic metrics are added to the routing protocols 
OSPF and ISIS; and some enhanced functionality of label request, label 
distribution, notification and LSP maintenance are added to the RSVP. To 
know about the optical extensions to these protocols, readers are strongly 
encouraged to derive background knowledge of the original versions of the 
protocols. This subsection, however, is aimed only at the extensions of these 
protocols rather than the operation of the protocols themselves. Due to the 
similarities of purpose between OSPF-TE and ISIS-TE, RSVP-TE and CR- 
LDP, the following paragraphs present some important optical extensions to 
OSPF-TE and RSVP-TE. The generalized-MPLS (GMPLS) signaling 
architecture is supported by the TE and OPT extensions. All the existing IP 
protocols along with the extensions are concluded into the GMPLS signaling 
defined by IETF. 
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2.2.3.1 Extension to the OSPF-TE: 

The optical extensions to the OSPF-TE to support GMPLS can be 
summarized into the following two categories: TE Link advertisements and 
Routing enhancements [13]. 

Traditionally, a TE link is advertised as an adjunct of a "regular" OSPF 
link. Optical extension challenges this notion in three ways. First, links that 
are not capable of sending and receiving on a packet-by-packet basis may 
not have TE properties. However, an OSPF adjacency (which is a link or a 
group of links advertised in the OSPF link-state advertisements (LSAs) as a 
“link”) cannot be brought up on such links. In other words, an OSPF 
adjacency cannot include the l inks that accommodate wavelength routed 
optical data stream, since the traditionally defined TE properties (e.g., the 
maximum reservable bandwidth, etc.) on these links are invalid. Second, an 
LSR can be advertised as a point-to-point TE link; thus, an advertised TE 
link need no longer be between two OSPF neighbors. This modification is 
for the purpose that an LSR may not be brought up as a regular OSPF node, 
which is instead assigned the status of a TE hnk. Finally, a number of links 
may be advertised as a single TE link (perhaps for improved scalability), and 
consequently there is no one-to-one association of a regular adjacency and a 
TE link any more. Therefore, a general notion of a TE link is derived. When 
an LSR knows that a TE link is up, and can determine the TE link's 
properties, the LSR may then advertise that link to its OSPF neighbors using 
the TE Type/LeveWalue Triplets (TLVs). All the advertisements should be 
transported on the control network that is defined as a dedicated IP network. 
The control networks are not supposed to transport any data. 

The routing enhancements to the TE properties of optical TE hnks that 
can be announced in OSPF traffic TE LSAs have been defined by the IETF. 
Specifically, the following sub-TLVs are added: 

1. Outgoing Interface Identifier: A link from LSR A to LSR B may be 
assigned an "outgoing interface identifier". This identifier is a non-zero 
32-bit number assigned by LSR A. 

2. Incoming Interface Identifier: Suppose there is a link L fi:om A to B. Let 
link L' from B to A corresponding to the same interface as L has been 
assigned an outgoing interface identifier by B. The "incoming interface 
identifier" for L (from A's point of view) is defined as the outgoing 
interface identifier for L' (from B's point of view). 

3. Maximum LSP Bandwidth: The Maximum LSP Bandwidth takes the 
place of the Maximum Link Bandwidth in the original OSPF-TE (traffic 
engineering extension). 

4. Link Protection Type: The Link Protection Type sub-TLV represents the 
protection capability that exists on a link. It is desirable to carry this 
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information so that it may be used by the path computation algorithm to 
set up LSPs with appropriate protection characteristics. 

5. Link Descriptor: The Link Descriptor TLV represents the characteristics 
of the link, in particular the link type and the bit transmission rate. These 
characteristics represent one of the constraints on the type of LSPs that 
could be carried over the link. 

6. Shared Risk Link Group: A set of links may constitute a SRLG if they 
share a resource whose failure may affect all links in the set. For 
example, two fibers in the same conduit would be in the same SRLG. A 
link may belong to multiple SRLGs, and thus the SRLG TLV describes a 
list of SRLGs to which the link belongs. An SRLG is identified by a 32- 
bit number that is unique within an IGP domain. 

2.23.2 Extension to the RSVP-TE 

The extensions to the RSVP-TE to support GMPLS signaling can be 
divided into three categories: label related format, bi-directional path setup, 
and notification messages [14]. 

The extension efforts define formats for a generalized label request, a 
generalized label, a support for waveband switching, suggested label and 
label sets. In a network environment accommodating several multiplexing 
and switching techniques, new label formats and requests are necessary to 
make every steps as specific as possible. A support for the circuit-switched 
type carriers (e.g., fiber switching and lambda switching) needs new 
definitions of labels. The waveband switching provides a spectrum of 
switching quantity between fiber switching and wavelength switching, where 
a band of wavelengths is switched (e.g., 10 consecutive wavelengths) rather 
than a whole fiber or a single wavelength. 

A bi-directional path with each direction is required to be in the same 
physical links, which has been recognized as more efficient and cost- 
effective in the optical networks. The RSVP-TE has been extended so that a 
bi-directional lightpath can be set up in a move. The RSVP-TE also provides 
mechanisms for resolving contention when two or more attempts of resource 
reservation are onto the same resources at the same time. 

For the notification message, three signaling extensions are defined to 
modify error handling, enable expedited notification of failures and other 
events to nodes responsible for restoring failed LSPs. Any notification has to 
be requested and sent through the Notify Request object. The sending of a 
Notify message may be requested in both the upstream and downstream 
directions. 
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2.3 Generalized MPLS Architecture 



2.3.1 IVaffic Hierarchy Defined in Generalized MPLS 

The emergence of Generalized Multi-Protocol Label Switching 
(GMPLS) has opened a new era for the automatic control and management 
for the Optical Internet with a core technology of DWDM. GMPLS is 
extended from MPLS by IETF, which has collected numerous valuable ideas 
from both industry and academia on how to migrate an MPLS-based control 
plane to the optical domain. GMPLS can be described in brief as an optical 
extension to the MPLS-based control plane, which is in its on-going progress 
of standardization for supporting optical traffic with multi-granularity, and 
provides generalized labels to traffic flows with different switching types 
[4,5,6]. In the GMPLS architecture, labels in the forwarding plane of Label 
Switch Routers (LSRs) are not only limited to the packet headers and cell 
boundaries, but also time slots, wavelengths or physical ports. The 
arrangement of labels versus different types of interface is summarized as 
follows: for the packet-switched capable (PSC) interface, the label is a 20-bit 
number; for the layer-2 switch capable (L2SC) interface, the label is 
frame/cell headers; for the time-division-multiplexed capable (TDM) 
interface, the label could be the serial number of the time slot; for the 
wavelength-switched capable (LSC) interface, the label is the wavelength 
number of the lightpath; for the waveband-switched capable (WBSC) 
interface, the label is the ID of waveband in a local port; and for the fiber- 
switched capable (FSC) interface, the label is the local port ID of the fiber. 
The OXCs that support multi-granularity traffic are also called Multi- 
Granularity-OXCs (MG-OXCs). 

A connection can only be established between interfaces of the same 
type, which is termed a G-LSP in the context of GMPLS. Nodes in the 
optical networks equipped with GMPLS signaling are defined as G-LSRs. A 
hierarchy can be built if an interface is capable of multiplexing several G- 
LSPs with the same technology, which is called a “nesting of traffic flows”. 
In the same maimer, the nesting can also occur between G-LSRs with 
different interfaces. An interface hierarchy from the highest to the lowest 
level is as follows: FSC, WBSC, LSC, TDM, L2SC, and PSC, as illustrated 
in Fig. 2.5. G-LSP nesting is realized in such a way that a G-LSP that starts 
and ends on a lower level of interface is nested (together with the other G- 
LSPs of the same level) into a G-LSP that starts and ends on a higher level 
of interface. In the optical domain, there is no difference between the 
processing of LSC, PSC, L2SC and TDM traffic since all of them have to be 
de-multiplexed into wavelength-level flows to be further processed. Thus, 
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we must have at least three classes of granularities: lambda- (or wavelength), 
waveband-, and fiber-switching, in which several lambda-switching flows 
can be nested into a waveband-switching flow, and in turn several 
waveband-switching flows can be nested into a fiber-switching traffic flow. 




Fig. 2.5. Illustration of the traffic hierarchy defined in GMPLS. 

2.3.2 Functional Architecture of MG-OXCs 

The above description on the control and management of multi- 
granularity traffic has been documented in some recently reported Internet 
Drafts [4,5,6]. The realization of each hierarchy of traffic in the physical 
layer is well defined except for the waveband-switching. Three alternatives 
to implement waveband-switching in the optical domain have been reported, 
which are inverse multiplexing, wavelength concatenation, and physical 
switching of a waveband [5]. The former two methods are for bundling 
Lambda-LSPs (L-LSPs) into a logical waveband without a realization in the 
transport plane, and do not consider the fact that a waveband has a physical 
significance as well as a specific interface. We need a physical waveband 
which directly switches a spectrum of the wavelengths and is transparent to 
its inner wavelength individuals. 

Fig. 2.6 demonstrates the switch architecture of an MG-OXC for 
manipulating multi-granularity traffic, in which physical waveband- 
switching is realized. The fiber- and waveband-switching boxes at the left- 
hand side function as selectors on the input fibers and wavebands. The 
waveband- and fiber-switching boxes on the right-hand side perform space- 
switching on the traffic leaving the node. The input traffic takes either one of 
the three switching types in the optical domain according to the 
configuration in the switching boxes of selection. 

The upper control box is where signaling mechanisms are generated, in 
which the control packets can be transmitted either out-of-band or in-band. 
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For the out-of-band case, an IP network is built either on a single dedicated 
lightpath in one of the fibers, or on a single dedicated fiber bundled with the 
other fibers in the data plane. For the in-band case, the control packets are 
embedded in the data frame. The functional architecture of MG-OXC can be 
applied to each of the three transmission types for signaling mechanisms 
mentioned above. 



Input 

Fibers 




Output 

Fibers 



Wavdength 

mullipkxer 



Fig. 2.6. Architecture of a node that implements multi-granularity traffic 
tunneling. 

2.3.3 Switch Descriptions 

With MG-OXCS, the total size of switching fabrics is reduced and may 
cause less crosstalk [7]. We assume that a crossbar matrix approach is used 
in each switching box, and there are F;, F 2 and F 3 input fibers for fiber-, 
waveband- and lambda-switching. Let K wavelengths (or B wavebands) be 
contained in a fiber. The number of switching points (5F) for the MG-OXCs 
and Lambda switching OXCs are: 

SPSar=^-^"+^-(F, + F,f-B + {Fj-K 



n-pLambda a2 

^^CrossBar - ^ ^ 



Here we have A = F, + Fj + Fj . 




25 



If the Benes network (which achieves strict-sense non-blocking Cantor 
networks) approach is adopted, the number of switching points for both 
cases are [8]: 



SP. 

2 



Benes 

A 

2 



^ • (log, A) ■ (2 • logj A - 1) + B • • logj (F, + Fj ) • (2 • log, (Fj + F 3 ) - 1 



+ /s:-^-(log,F3)(2-log,F3-l) 



( 2 . 1 ) 



=^-|-aog2A)-(2-log,A-l) (2.2) 

We can see that the use of MG-OXCs can reduce the switch size in both 
crossbar matrix and Benes architectures if the percentage of each switching 
capability is well designed. For example, if F;, F 2 and F 3 is 10, 10, and 12, 
respectively, and if B and /sT is 8 and 64, respectively, then the number of 
switching points of an MG-OXCs and lambda-switching OXC is 17,160 and 
65,536 for the crossbar case, and 16,150 and 46,080 for the Benes network 
case. If the percentage of fiber and waveband-switching capability is 
increased, the number of switching points can be further reduced. 

The switching loss for a fiber with different switching types through an 
MG-OXC with the Benes network approach is formulated as follows [8]. 

pMG _ npMG y. r 

^lambda ~ '^^Benes ^ 

[2-(2.1og,A-l)-K2-(2.1og,(F,+F3)-l)+(2-log,(F3)-l)].L, 

C=sp„*xi's=2-[(2-log2A-l)+(2.1og2(F,-HF3)-l)]-L, 



^Fiber ~ fiber ' (2 ‘ 10g2 A l)]- Lg 

where Ls is the power loss through each switching point. 

In the above formulation, since the power loss in different switching 
types is different, the compensation for the switching power loss should be 
different for fibers with different switching types. On one hand, an 
insufficient compensation will undoubtedly impair the signal quality; on the 
other hand, an over-compensation may saturate power level along a fiber and 
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also degrade signal quality [9]. Therefore, for the MG-OXCs, regeneration 
should be performed intelligently on-demand according to a real-time 
configuration. The fiber switching boxes in Fig. 2.6 can be devised as shown 
in Fig. 2.7 for on-demand power loss compensation. The fibers experiencing 
lambda-, waveband- and fiber-switching will be compensated differently. By 
observing Equations (1) and (2) with Benes architecture, the increase of the 
I/O ports in fiber-switching boxes increases the total number of switching 
points only to a very limited extent (less than 10% in the example given). 




Fig. 2.1. k smart regeneration mechanism that can dynamically 
configure the amount of compensation to a specific fiber. 



The most notable advantage of using the 4-tier MG-OXCs is the cost 
reduction, which results from a reduction in the number of necessary 
lambda-switched devices including demultiplexers and switching fabrics. 
Another advantage of adopting the MG-OXCs is that a traffic hierarchy in 
the optical domain can be achieved, which facilitates the migration of the 
MPLS-based control plane to the optical domain. However, this is limited by 
wavelength continuity constraint due to the intrinsic difference between 
LSPs and L-LSPs. We have to consider whether an L-LSP is on a right 
wavelength plane before knowing whether it can be physically switched 
together with the other L-LSPs in the fiber- or waveband-tunnel. To 
guarantee the availability of transition between different switching tjqies 
(e.g., F-LSP L-LSP) at ingress and egress nodes of a tunnel, a partial 

wavelength conversion with a percentage of + ^‘^^ xl00% is 



necessary, where Fj, F 2 and F 3 are treated respectively as the switching 
capability for L-LSP, WB-LSP (Waveband-Switched LSP) and F-LSP 
(Fiber-Switched LSP) in the node. However, if the conversion capability is 
insufficient, an L-LSP may not be able to be bundled into a tunnel even if 
the tunnel has residual bandwidth. A similar situation happens when an L- 
LSP originally tunneled in an F-LSP/WB-LSP may not be able to 
demultiplex at the egress node of the tunnel to a specific wavelength plane. 
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The advantages of cost reduction and traffic hierarchy are gained at the 
expense of a decrease in throughput and an increase in the number of 
network states which are more difficult to track. The constraints on bundling 
lightpaths in the waveband- and fiber-switched fibers reduce the utilization 
of these fibers and in^air the performance. The computation complexity in 
path selection is increased as more network states need to be considered. 

2.4 Routing and Signaling Models 

This section presents the service and routing models defined in some 
IETF Drafts. In terms of interconnection services provided by networks, 
there are, in general, two models reported: the domain service model and the 
unified service model. In terms of routing functionality, there are three 
models reported: the overlay model, the integrated/augmented model, and 
the peer model. 

2.4.1 Service Models 

A service model of a network determines the signaling interfaces and the 
relationship between the user-network interface [3,10] (or UNI) and the 
network-network interface (or NNI). The service model of a network affects 
the signaling complexity at the IP-optical interface. There are two proposals 
of the service model for the next-generation optical Internet, which are the 
Domain Services model and Unified Services model. 

2.4.1.1 Domain Services Model 

This model was first seen in the Optical Domain Service Interconnection 
(ODSI) of the Optical Intercoimect Forum (OIF) [11]. The interconnection 
model is illustrated in Fig. 2.8. The user networks are not required to be 
MPLS-capable, while the UNIs will take over all the issues of provisioning 
lightpaths corresponding to the user requests. 

The border routers are in charge of the derivation of the reacheability of 
the other border routers, and the availability of wavelength channels in the 
optical domain. A virtual topology based on the demand of lightpaths (or G- 
LSPs) can be established upon the physical optical core circuits. 




28 




Fig. 2.8. The interconnection architecture for the Domain Service Model; 

TU is an abbreviation of Transportation Unit. 

A suite of standardized signaling primitives across the UNI are defined: 

1. Lightpath Creation: This service allows a lightpath with a specific 
attribute to be created between a pair of terminating points in the optical 
network. Lightpath creation may be subject to network-defined policies 
(e.g., connectivity restrictions) and security procedures. 

2. Lightpath Deletion: This service allows an existing lightpath to be 
deleted. 

3. Lightpath Modification: This service allows certain parameters of a 
lightpath to be modified. 

4. Lightpath Status Enquiry: This service allows status of certain 
parameters of a lightpath (referenced by its ID) to be queried by the 
router that created the lightpath. 

5. Client Registration: This allows a client to register its address and user 
group identifier onto the optical network. 

6. Client De-Registration: This allows a client to withdraw its address and 
user group identifier from the optical network. 

7. Link-State Query: This allows a client to supply another clients native 
address and user group ID, and get back an optical-network-administered 
address that can be used in lightpath setup messages. An end-system 
discovery procedure may be used over the UNI to verify local port 
connectivity between the optical and client devices, and allow each 
device to bootstrap the UNI control channel. 

8. Service Discovery Procedures: Service discovery allows a client to 
determine static parameters of interconnection with the optical network, 
including UNI signaling protocols supported, address resolution and 
neighbor discovery. Because a small set of well-defined services is 
offered across UNI, the burdens of signaling protocol within the optical 
domain (i.e., RSVP-TE or CR-LDP) are minimized. However, the 
signaling protocol is still required to convey a few messages with certain 
attributes point-to-point between the user router and the optical network. 
The optical domain services model provides an explicit control approach 

and interconnection method for IP and optical domains. However, it does not 
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consider the versatile types and natures of different routing protocols across 
the two domains. Furthermore, the integration of multiple optical sub- 
networks across NNI will require the specification of a standard routing 
protocol such as BGP. We may need a suite of integrated control 
mechanisms over the two domains. 

2.4.1.2 Unified Services Model 

This model mainly came from the MPLS-based control plane, in which 
the interconnection services are not specifically defined in UNI, instead, 
every node can communicate with the other nodes through standard 
signaling and routing protocols. Under this model, the LSRs in the electronic 
domain and the OXCs in the optical domain are treated as integrated 
network elements as shown in Fig. 2.9. Thus, from a routing and signaling 
point of view, there is no difference between UNI, NNI, and any other 
router-to-router interface. Under the unified service model, optical network 
services are obtained implicitly through end-to-end GMPLS signaling 
mechanisms. Specifically, an edge router can provide MPLS-based service 
as it creates or deletes G-LSPs. Therefore, the services obtained from the 
optical networks are similar to the domain services model. The only 
difference between the two models is that the domain service model defines 
UNI and NNI that yield a clean separation between domains with different 
administrations. On the other hand, the unified services model stretches the 
GMPLS signaling and OSPF routing instances into the user networks; 
therefore, the interconnection of domains with different administrations may 
be maintained more seamlessly. For instance, a remote router could compute 
an end-to-end path across an optical network utilizing OSPF-TE. It could 
then establish a G-LSP across the optical network using GMPLS signaling. 
But the edge routers must still recognize that a G-LSP across the optical 
network is a lightpath, or a conduit for multiple LSPs, in which the jobs 
originally designed for UNI and NNI are taken over by the edge routers. 




Fig. 2.9. Interconnection architecture for the Unified Services Model; TU 
is the abbreviation of Transportation Unit. 

The decisions as to when to create lightpaths, and the bandwidth 
management for these lightpaths, are identical in both the domain service 
model and the unified service model. 
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2.4.1.3 Routing Models 

From the viewpoint of routing (or path selection), three models have been 
proposed by the IETF Drafts: the overlay model, integrated/augmented 
model, and peer model [3]. 

The overlay model has the user domain (or IP domain) independent of 
the optical domain. IP domain acts as a client to the optical domain, which 
provides a service of point to point connection to the upper IP domain. The 
routing and signaling protocols in the IP domain are independent of those in 
the optical layer. The overlay model may be statically provisioned using a 
Network Management System (NMS), or may be dynamically provisioned 
through a distributed control process. Note that the static provisioning 
solution may not be scalable, which may incur an N-square problem in any 
expansion to the network size. The overlay model asserts that a suite of 
separate protocols should be developed for the transport layer (i.e., the 
Transport Unit (TUs)), which is similar to the case of the IP over ATM 
networks. The addressing, link-state dissemination and the way of topology 
discovery in the IP domain, are also independent from that in the optical 
domain. 

In the peer model the optical routers and optical switches act as peers. 
There is only one instance of a routing protocol miming in the optical 
domain and in the IP domain. Instead of the Border Gate Protocol (BGP), a 
common Interior Gateway Protocol (IGP), like OSPF or ISIS with optical 
extensions, is used to exchange topology information. The basic assumption 
in this model is that all the optical switches and the routers have a common 
addressing scheme, which is different from the case in the overlay model. 
For performing path selection under the peer model, an IGP like OSPF-OPT 
or ISIS-OPT is used to exchange topology information, in which the unique 
optical link parameters (such as per-wavelength availability) in the optical 
domain are captured. The OXCs and the routers maintain the same link-state 
database. The routers can then compute end-to-end lightpaths traversing 
through user and optical domains. 

The fully peered routing model obviously suffers from a scalability 
problem. The problem lies on the fact that all routers have to hold the 
dynamic link-state of both domains no matter in which domain the routers 
are located. ^ addition, the peer model has been questioned as to whether or 
not users need to be made aware of the topology and physical parameters of 
the optical core, which may include some confidential information. 

The third routing model is the integrated/augmented model, in which 
signaling and routing mechanisms between the user and optical domains are 
integrated. With the integrated/augmented model, the services provided in 
the optical core are embedded into the platform of the IP/MPLS service 
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entities. The model is fully integrated if the IP/MPLS layers act as peers of 
the optical transport network, such that a single routing/signaling instance 
runs across the IP/MPLS and optical domains. An IGP like OSPF-TE or 
ISIS-TE will be used to distribute topology information. This model assumes 
a common addressing space for both domains. 

The integrated/augmented model initiates a compromise between the 
fiilly integrated model and peer model. It allows separate routing instances 
running in the IP/MPLS and optical domains, while information from one 
routing instance is “leaked” into the routing instance of the other domain. 
For example, IP addresses could be assigned to the optical domain and 
carried by optical routing protocols so that the reacheability information is 
sharable between the two domains. From the viewpoint of control and 
management there is little difference between the two choices; however, the 
integrated model avoids the extra work of developing routing and signaling 
for the optical domain, which is preferable if no further administration or 
security problems are considered. 

2.5 Summary 

In this chapter, we introduced the architecture of MPLS-based control 
plane for the IP centric optical networks. The MPLS protocol was first 
presented, from which the Generalized MPLS was extended. The migration 
from the IP/MPLS control plane to the optical domain was discussed in 
detail. Control and management on multi-granularity OXCs was presented, 
which aims to reduce network costs. We also introduced the extensions to 
the existing routing and signaling protocols, including OSPF and RSVP, for 
a complete migration from the IP/MPLS to the optical domain. Finally, we 
introduced the routing and signaling models defined by IETF. 
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Questions 

1. What are the design principles which characterize the MPLS control 
plane? 

2. What are the main functional elements in the MPLS-based control plane? 
Describe the function mechanisms for each of the functional elements. 

3. Explain what label swapping and packet forwarding in the MPLS control 
plane are. 

4. Describe the format of the MPLS packet. What are the purposes of 
setting up a Forwarding Equivalent Class (FEC)? 

5. Describe the reason why people are interested in migrating MPLS control 
plane to the optical domain. 

6. Describe the analogies and heterogeneities between Label Switch Routers 
(LSRs) and Optical Cross Connects (OXCs). 

7. Lists two signaling protocols and two routing protocols for optical 
networks that are being standardized by Internet Engineering Task Force 
(IETF). Briefly describe their functions. 
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8. Suppose the MG-OXC has the following percentage for each switching 
type: Fiber-Switching: Fl/F, Waveband-Switching: F2/F, and Lambda- 
Switching: F3/F. The cost of the OXC is proportional to the number of 
switch points in the switching fabric (i.e., S units for having each switch 
point). We also assume each fiber contains K wavelength chaimels. 

(a) Derive the cost saving by using an MG-OXC and a Lambda- 
Switching OXC given that crossbar matrix approach is adopted. 

(b) Repeat (a) if the Benes Network approach is adopted in the switching 
fabrics. 

(c) Assume the MG-OXC is adopted in a network with the same 
percentage for each switching type all over the network. In the steady 
state, the number of lightpaths existing in each switching layer is Lls, 
Lwb. and Lpiber in average. Assume the setup of a lightpath yields 
revenue C units per day. It can be expected that the revenue in a 
network with only Lambda-Switching OXCs should be larger than in 
the case with only the MG-OXCs. How long does a pure Lambda- 
Switched network take to compensate the extra cost in the switching 
fabric compared with an MG-OXC network? Assume the link 
utilization between the Lambda-Switching layers of the two networks 
is the same. (Hint: use the results of part (a) and (b)). 

9. List the traffic hierarchy defined in Generalized MPLS. Which are the 
ones in the optical domain? List the traffic granularities from the coarsest 
to the finest. 

10. If the device shown in Fig. 2.7 is adopted, how many additional switch 
points does the modified Fiber-Switching Box need? Using the symbols 
provided in Q.8. 

11. When running in an MG-OXC network, since the power attenuation for 
the fibers dedicated to different switching t)^es is different, we need 
regenerators with different amount of power compensation for each of 
them. What is the difference in power attenuation between the fibers with 
each switching type? Assume that the crossbar matrix architecture is 
adopted (i.e., a single stage is experienced when a lightpath traverses 
through a switching box), each stage in a switching box attenuates S dB 
of power, and a multiplexer/demultiplexer attenuates M dB. 

12. Repeat Q.ll with the switching matrix changed to a Benes network 
approach. (Hint: the number of switch points in a switching box traversed 
by a lightpath is2 • logj A - 1, where A is the number of input ports to 
the switching box). 

13. What are the limitations upon the inspection of a packet header in the 
optical domain? 

14. What are the benefits of using multi-granularity OXCs (MG-OXCs) in 
optical networks? What are the tradeoffs for these benefits? 
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15. Write down two switch architectures to implement large scale optical 
networking mentioned in this chapter. 

16. Describe the service and routing models defined in GMPLS control 
plane. 




Chapter 3 



ROUTING AND WAVELENGTH 
ASSIGNMENT (RWA): OVERVIEW 



Routing and wavelength assignment (RWA) constitutes one of the most 
fundamental elements in the control and management of an optical network. 
This chapter provides an introduction to the design principles and objectives 
of the constraint of the RWA problem, and an overview of proposed 
schemes and approaches of the implementation and analytical modeling of 
the RWA process. The constraints imposed on a wavelength-routed optical 
network are outlined in Section 3.2, which includes topics of physical 
constraint and diversity constraint. Section 3.3 describes modeling of optical 
networks with and without wavelength conversion capability. The model 
incorporates networks containing partial wavelength convertible nodes with 
different conversion ranges. We also provide an algorithm for performing 
optimal routing and wavelength assignment in networks with nodes of 
partial wavelength conversion capability. The approach is based on a 
modified Dijkstra’s shortest path first algorithm, and can achieve a good 
computation complexity compared with some other reported schemes. 

Section 3.4 provides an overview of recently reported dynamic path 
selection schemes, in which fiilly-adaptive and partially-adaptive routing 
schemes are discussed. In addition, this section also presents recently 
reported dynamic wavelength assignment schemes. The link-state coherence 
becomes a problem in dynamic networks, by which a distributed 
computation for RWA can be performed in the source node (or termed 
source routing). Section 3.5 briefs RWA schemes with static traffic, which 
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contains an Integer Linear Programming formulations and heuristic 
approaches for solving the problem. 

3.1 Introduction 

Routing and wavelength assignment (RWA), also called path selection in 
the optical domain, is a key process of provisioning lightpaths in response to 
connection requests. For a wavelength-routed network, lightpaths are set up 
for connection requests between node pairs. A single lightpath occupies the 
same wavelength on all of the spans along its physical route under the 
wavelength continuity constraint. In the event that all-optical wavelength 
conversion is allowed in some or all network nodes, the wavelength 
continuity constraint does not always hold where a lightpath may take 
network resources on different wavelength planes. Regardless of whether the 
all-optical wavelength conversion ability is provided in each node or not, a 
physical route must be selected and a wavelength must be assigned to the 
lightpath. 

3.1.1 Static RWA Process 

We term an RWA process “static” if it is designed on the premise that the 
traffic distribution in a network will not be changed while the network is in 
operation. In the static case, a traffic matrix is given, which specifies the 
bandwidth demand between any node pair of the network. Then lightpaths 
are allocated to the network according to the traffic matrix. The design 
objective can be either to achieve a maximum throughput given the total 
network capacity (i.e., the number of fibers along a link and wavelength 
plane along a fiber) or to satisfy all the traffic demand with a least amount of 
fibers along each link or wavelengths contained in each fiber. In general, the 
static traffic design is usually required in the backbone networks that span 
across countries and continents, in which the setup or tear-down of 
lightpaths is not frequent (e.g., monthly, etc). The bandwidth provisioning 
along each lightpath is normally OC-192 (40Gbps) or OC-768 in the near 
future (160Gbps). Therefore, it is feasible for network management system 
(NMS) to optimize the throughput globally based on the entire traffic 
demand and upon all available network resources. With the premise that all 
the coimection requests defined in the traffic matrix need to be allocated, the 
optimization objective can be either to minimize the number of wavelengths 
per fiber if the number of fibers along a network span is fixed, or to 
minimize the number of fibers along a network span with a fixed number of 
wavelengths per fiber. An alternative is the combination of the above two, 
where the total lightpath mileage (or equivalently, the revenue generated) 




37 



that can be allocated into the network is maximized. Note that in this case, 
we usually assume that the number of fibers along a span and the wavelength 
planes per fiber are fixed. 

The static RWA problem can be performed through an optimization 
process such as Integer Linear Programming (ILP), which is notorious for an 
NP-complete computation complexity. Therefore, the optimization process is 
only feasible for small-sized networks (i.e., the number of nodes, spans in 
the network, and the number of wavelength channels along a single fiber are 
small). To reduce the computation complexity, the optimization process can 
be divided into two sub-processes, namely physical path selection and 
wavelength assignment, for deriving an approximate optimal deployment of 
lightpaths. Heuristic algorithms have also been devised for trading capacity 
efficiency with computation complexity. 

3.1.2 Dynamic RWA Process 

In addition to the static one, the other type of RWA problem is for a 
network with dynamic traffic. The type of RWA process is also called a 
dynamic RWA, where the computation complexity directly influences 
lightpath provisioning speed. The requirement for network dynamicity is 
mainly imposed by middle-sized networks such as the metropolitan area 
networks, in which the setup and tear-down of a lightpath is getting cheaper 
and with smaller granularity when compared with the Internet core networks. 
The requirement of dynamicity has also yielded a much different design 
objective from that of the Internet. In this case, the static RWA process 
cannot be applicable to dealing with traffic distribution that is changing from 
time to time. 

The dynamic RWA is aimed at satisfying lightpath setup requests one at 
a time with a goal of maximizing the probability of successful allocation for 
the subsequent demands. The focus of research is to ensure that traffic 
distribution is balanced, and that the segmentation of network resources is 
avoided as much as possible. The most commonly used performance index 
in the dynamic network is the blocking probability under specific potential 
traffic load for each S-D pair. The potential traffic load is defined as the 
average ratio of arrival and departure rates of connection requests between 
an S-D pair. The selection of lightpath(s) for a connection request can be 
formulated into a mathematical problem based on the current link-state, and 
is solved according to a custom-defined routing and wavelength assignment 
scheme. Any change of the traffic distribution has to update the link-state 
database. Based on the database, the next connection request can be 
allocated. 
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3.2 Constraints on Routing and Wavelength Assignment 

The routing constraints can be divided into three major categories: 
constraints imposed by the physical transmission medium, the constraints 
imposed by diversity requirements, and the wavelength continuity constraint. 
This section gives an overview of the three constraints, as well as of the 
design principles for RWA strategies in networks subject to the three 
constraints. 

3.2.1 Physical Constraints 

Physical constraints are ubiquitously imposed upon the transparency of 
optical networks, which may impair the availability of lightpaths. Network 
management is complicated by the transparency requirement in that there is 
no simple way to ensure that signals entering or leaving a node are valid. 
Most of the research on the routing and wavelength assignment is based on 
the assumption that all lightpaths have perfect signal quality, where network 
links are taken as “opaque” by adopting transponders so that regeneration, 
re-shaping, and re-timing (also known as the “3R”) are provided if 
necessary. The node equipped with transponders taps into the optical flow in 
a fiber, converts it to an electronic signal, and then transmits a fresh copy of 
the signal. Since a transponder is expensive, they may constrain the rapid 
development of new services, such as the use of the MG-OXCs and Photonic 
Cross Connects (PXCs). Therefore, it is necessary to define a “domain of 
transparency” [1, 2,3,4] to form an all-optical sub-network, in which the use 
of transponders can be eliminated. However, being limited by the 
transmission technology, all-optical networks without transponders can 
hardly be feasible for multi-hop data transportation under stringent physical 
constraints. Therefore, the design goal is to put a moderate number of 
transponders with their location well-designed so that a network operation 
with an optimal performance-to-cost ratio can be yielded. In terms of the 
path selection process, “domain of transparency” is defined for each 
potential source node, and is a sub-network in which the physical constraints 
can be totally relaxed when performing routing and wavelength assigiunent 
initiated at the node. Different transparent domains can be coimected through 
a border router, which provides traffic regeneration, access, and regular 
control signaling mechanisms. 

The task of defining a “domain of transparency” much depends on the 
physical apparatus and communication equipment adopted, and may be a 
group of network nodes and links. Note that nodes and links in a domain of 
transparency are not certainly close in physical distance if each link is 
subject to different extent of physical impairment. We can set up a whole 
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network to be a single domain of transparency instead of each sub-area 
network. However, in this case, not only more transponders are needed to 
compensate all possible long lightpaths, but the design is not scalable when 
the network is extended, requiring all equipment to be upgraded to support 
the new worst case path. Therefore, it is better to solve physical constraints 
through the efforts of routing and wavelength assigmnent schemes that can 
adapt to network size and dimension. An optimal or near-optimal 
deployment of transponders, along with a properly designed routing and 
wavelength assignment strategy, is essential for maximizing and 
economizing the network resource utilization. Since all the physical 
parameters have to be known during the design process, extensive 
knowledge of physical constraints is necessary. 

The physical impairments may distort optical signals in a linear or non- 
linear maimer as the optical flow traverses along a fiber or through a switch, 
which leads to a loss of optical signal quality. The loss of optical signal 
quality can be attributed to three major factors: Signal Power Loss, Noise, 
and Crosstalk. Signal Power Loss is wavelength-specific and linearly 
proportional to the distance of transmission in fiber optics. For example, 
signals carried on channels in the 1550 nm region have attenuation as low as 
0.2 dB per kilometer. However, with other wavelength ranges out of the 
1550 nm region the attenuation can be much larger. Another type of physical 
constraint that degrades the Signal to Noise Ratio (SNR) during transmission 
in fiber optics is the linear impairment, which includes Polarization Mode 
Dispersion (PMD), Amplifier Spontaneous Emission (ASE), and Chromatic 
Dispersion. The linear impairment is independent of signal power and 
wavelength, and can be modeled in clean mathematical forms for linear 
aggregation. Crosstalk belongs to a non-linear inqiairment that is mainly due 
to interference between different lightpaths as they traverse the same optical 
component, such as a directional coupler or a multiplexer/demultiplexer. In 
general, the non-linear impairments are hard to deal with explicitly in a 
routing algorithm because they lead to constraints that can couple routes 
together and lead to complex dependencies. For example, the order of fiber 
types through which a flow traverses affects the distortion. Another example 
is crosstalk between two lightpaths traversing the same directional coupler. 
If the two lightpaths are in the same wavelength plane, the crosstalk cannot 
be filtered out at the end node and as a result impairs the SNR. 

3.2.2 Diversity Constraints 

One of the major reasons for imposing the diversity constraint in path 
selection is for the purpose of achieving survivability. In the optical 
networking layer, two or more lightpaths are said to be diverse if they will 
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never be subject to a single failure at the same time. To determine whether 
two or more lightpaths are subject to a single failure, we have to define a 
hierarchy of Shared Risk Link Groups (SRLGs) (which will be further 
detailed in Chapter 6). 

The lightpaths flowing in a fiber can be treated in an SRLG because they 
share the same risk of being damaged by an accident such as a rodent bite. 
The fibers bundled together as a conduit are buried in a right-of-way (ROW), 
which is frequently obtained from railroads, pipeline companies, or sewage. 
An ROW can be in an SRLG, which shares the same risk of being cut by 
construction or a traffic accident. On the other hand, even two ROW’S that 
are far away from each other may be subject to a single natural catastrophe, 
such as an earthquake or a tornado. Therefore, the diversity constraints to the 
path selection process have to be specifically defined so that different types 
of single failure can be recognized. With this, a hierarchy of SRLG can be 
derived so that the diversity requirement can be simply modeled as a routing 
constraint used by path selection algorithms for detailed implementation. We 
wUl introduce the task of spare capacity allocation schemes in Chapter 7 and 
diverse routing and wavelength assignment in Chapter 8. A conduit that 
contains unidirectional fibers is defined as a link group subject to a single 
failure, because a conduit cut due to construction, traffic accident, or rodent 
bite, is the most commonly seen failure in Wide-Area and Metropolitan-Area 
networks. 

3.2.3 Wavelength Continuity Constraint 

The wavelength continuity constraint is unique to optical networks with 
WDM as the core technology, and is a consequence of multiplexing several 
wavelength channels into a single fiber. The wavelength constraint stipulates 
the usage of wavelength channels along a lightpath, which can be treated as 
a concatenation of these wavelength channels to form a loop-less simple 
path. Without wavelength converters or the use of O/E/0 conversion, the 
wavelength channels for a lightpath have to be on the same wavelength 
plane. When performing a path selection process, the above requirement 
restricts bandwidth utilization by increasing resource fragmentation, which 
behaves as the most critical performance impairment to an optical network. 

Due to the wavelength continuity constraint, the WDM networks are 
intrinsically different from the other coimection-oriented networks in terms 
of bandwidth allocation. The purpose of modeling a WDM network into a 
graph is to facilitate the use of any adaptive routing scheme, such as 
Dijkstra’s shortest path algorithm. According to the wavelength conversion 
capability of each node, a multi-wavelength network can be in any one of 
three categories: full-wavelength convertible, partial-wavelength convertible 
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and no-wavelength convertible [5]. A node is full-wavelength convertible if 
every input optical flow dan interchange its wavelength plane with another 
flow on a different wavelength plane. A node is no-wavelength convertible if 
the wavelength continuity constraint is always held. For the partial- 
wavelength convertible networks, only some nodes may have wavelength 
converters. A node with wavelength converters may be able to perform a full 
wavelength conversion or partial wavelength conversion. In the event of a 
partial-wavelength conversion, the node may either be able to interchange a 
fixed amount of lightpaths on different wavelength planes (termed the 
capacity of wavelength conversion) no matter what wavelength planes the 
lightpaths are located on, or only interchange specific groups of wavelengths 
(termed the range of wavelength conversion). For example, a lightpath on 
wavelength plane 1 (Wl) can only interchange with the lightpath on W3. 

A network can be full-wavelength convertible when all wavelength 
channels on different wavelength planes are exchangeable during the routing 
process. In this case, the network can be simply modeled as having 
Fx^T wavelength channels in each span, where F and K are the number of 
fibers along a span, and the number of wavelength channels along a fiber, 
respectively. If only a partial range or a partial capacity of wavelength 
conversion is provided, the wavelength conversion capacity should be taken 
as a kind of the network resources in each node that can be exhausted. Each 
node may not be equipped with the same wavelength conversion capacity 
and conversion range. Here we define the wavelength conversion capability 
of a node as the maximum number of lightpaths that can be converted in this 
node at the same time. 

3.3 Network Modeling 

This section discusses the modeling of networks with wavelength 
convertible nodes. Networks with fiill-wavelength conversion capability can 
take the wavelength channels along a fiber as a bulk of bandwidth instead of 
as independent wavelengths. The optimal routing in this case is simplified to 
a path selection problem in circuit-switched telephony networks. The no- 
wavelength convertible case can be solved by considering each wavelength 
plane separately. Optimal routing algorithms were proposed in [6,7], where a 
modified Dijkstra’s algorithm was performed upon a layered wavelength 
graph (WG) (which will be discussed later). Heuristic algorithms for 
performing RWA, such as SPREAD, PACK, Least-Loaded [8] etc, have 
been extensively proposed. We will further investigate these dynamic RWA 
schemes in Section 3.4. 

As for partial-wavelength convertible networks, in general, the modeling 
is more complex than that in the other two cases in that the distribution of 
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wavelength converters can be heterogeneous and with different conversion 
ranges in each node. Each node may not be equipped with the same 
wavelength conversion capacity and the conversion ranges. We need a more 
generalized modeling technique for the partial-wavelength convertible case. 

Fig. 3.1 (a) shows an example of a wavelength graph for a simple WDM 
mesh network with three wavelength planes along each fiber. The vertical 
connection between a pair of physically corresponding nodes (termed 
wavelength nodes in the following context) represents the conversion 
capability between two specific wavelength planes at the node. Any two 
wavelength nodes at the same physical location are isolated because the 
resource for wavelength conversion between the two wavelength planes at 
this node (or the base node) has been exhausted. The exhaustion of 
wavelength conversion capability between two wavelength planes can be 
because there is no wavelength converter available between these two 
wavelength planes at this moment. 

We term the WG illustrated in Fig. 3.1 as a simple WG due to its 
straightforward expression. However, adopting it for performing optimal 
routing incurs a serious problem: the cost of taking a wavelength converter is 
hard to define. The basic difficulty lies in that the conversion between 
different wavelength planes needs to be specified individually. Fig. 3.1(b) 
illustrates the enhancement to the simple WG for a partial-wavelength 
convertible node. The solid lines express the conversion capability between 
two separate wavelength planes. Extra links are added between every two 
wavelength nodes at a base node for modeUng the partial wavelength 
conversion capability. This scheme is also subject to some drawbacks: Not 
only does the creation of the layered WG become more complicated, but the 
number of network links is also increased by several times the scale of the 
original graph, which may add a significant extra cost to the link-state 
management. Therefore, some other modeling schemes have been proposed. 
We will introduce two of them in this section. 

3.3.1 Wavelength Graph with Extra Nodes 

An approach to modeling a network with partial wavelength conversion 
was proposed by [9], in which extra nodes are added into the layered WG to 
model the effect of wavelength conversion with specific conversion ranges, 
as shown in Fig. 3.2. Fig. 3.2(a) shows a simple network containing 6 nodes, 
of which only node A and B are equipped with wavelength conversion 
capability. In Fig. 3.2(b), node A can exchange the wavelengths of lightpaths 
between W1 and W2 by routing the lightpaths through node A“, while node 
B can exchange a full range of wavelengths through node B®. 
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Fig.3.1. (a) Simple wavelength graph for the WDM networks with partial 
wavelength converters; (b) Wavelength nodes at the same physical location; 
the solid lines express the wavelength conversion capability and ranges. 

With the WG modeling, the original network is reconfigured into a 22- 
node graph, which contains 18 wavelength nodes (i.e., the nodes in the 
wavelength graph that may be a source or a sink of traffic), 2 extra nodes 
(i.e., the nodes for modeling the wavelength conversion capability), and the 
source and termination nodes. Each link has its own link cost, which is not 
necessarily the same with each other. The link cost connecting a wavelength 
node to an extra node is also referred to as the cost imposed on the 
consumption of the wavelength conversion capability in that node. As a 
connection request arrives (e.g., from D to C), Dijkstra’s algorithm can be 
conducted on the WG. The relaxation of the termination nodes will bring the 
routing process to an end. Fig. 3.3 shows the link-state matrix, which 
specifies the link cost between each adjacent node. Between two adjacent 
wavelength nodes the link cost is fo- Between a wavelength node and an 
extra node, the link cost is fab- When performing a path selection process, 
Dijkstra’s shortest path first algoritlun is invoked upon the link-state 
capacity matrix. With the modeling technique and a well-designed cost 
function and link cost, the optimal lightpath can be derived. 

The modeling of a layered WG shown in Fig. 3.1 generalizes the cases of 
full-wavelength conversion and no wavelength conversion. In Fig. 3.4, node 
A has full wavelength conversion capability if and only if > G for 

all i, where i is the index of wavelength planes, so that node A can convert 
all the lightpaths from the same wavelength plane as the other wavelength 
planes at the same time. In general, node A (or any node) will be designed 
such that Za = /a< “ /a which means that the conversion 

capability to and from different wavelength planes is the same. 
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Fig. 3.2. An example of layered WG with (a) the original topology; and 
(b) the layered WG with three wavelength planes. 



With the above modeling, however, the network is enlarged not only K 
times, where K is the number of wavelength planes in the network, but an 
extra amount of nodes have to be added into the network for the wavelength 
convertible nodes. Therefore, the total complexity of performing the 
Dijkstra’s algorithm is Oi(K-N + EN)-iN + l))«0{iK-N + EN)-N) , 
where N is the number of nodes in the original topology, and EN is the 
number of extra nodes added due to wavelength conversion. The term 
K-N + NE in the expression of complexity stands for the upper bound on 
the number of nodes that need to be relaxed. The second term N + 1 is the 
upper bound on the number of nodes that have to be inspected in each 
iteration, because a node can only be adjacent to at most + 1 nodes (i.e., N 
nodes in different physical locations on the same wavelength plane, plus an 



extra node). Note that E has an upper bound 




. N , which happens when 



every pair of convertible wavelength planes is isolated with each other, as 
shown in Fig. 3.5. 
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Fig. 3.3. The capacity matrix F for the layered wavelength graph shown 

in Fig. 3.2 



AO 




Fig. 3.4. An illustration of a network node with different wavelength 
conversion capability. 
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AOi A“ A<XW/2) 



A‘ A^ A^ a'"I-’ a'*' 

Fig. 3.5. The worst case of the layered graph. Node A is modeled with 
almost 50% more nodes than the no-wavelength convertible case. AOi is the 
ith converter node for node A. 

3 J.2 Simple WG mth Modified Dykstra’s Algorithm 

We reduce the computation complexity of the above scheme by adopting a 
modified Dijkstra’s algorithm, called Reserve-Converter-First (RCF), in 
which no extra node is needed in the wavelength graph except for the source 
and destination nodes. The computation complexity of using this approach 
reduces the Dijkstra’s algorithm in the extra-node method from 
0(iK-N + E)-N) toOiiK/J + N)-i{K-J)-N)), where J is the average 
number of wavelength nodes at the same physical position where the input 
per-wavelength flows can be converted each other. The value of J can be 
ranged from A - 1 (for full wavelength convertible networks) to 2 (the 
example shown in Fig. 3.5. 

First, let us briefly recapitulate Dijkstra’s original algorithm. Nodes in the 
graph are categorized into two groups: relaxed (or marked) nodes and un- 
relaxed (or unmarked) nodes. Each relaxed node is marked with the shortest 
path and its distance to the source. The iterative process relaxes an un- 
marked node with the shortest distance to a marked node. Once a node is 
relaxed, it is then marked with the distance and the shortest paths to the 
source. The iteration stops if the destination node is relaxed, or if no 
unmarked node with a finite distance can be found. 

The basic idea of this approach is to relax all the nodes that are 
interchangeable with a wavelength converter during the relaxation process. 
For example, in the layered wavelength graph of Fig. 3.6, the relaxation of 
A‘ by D‘ will also relax A^, so that node A' and A^ are set to have an equal 
distance to D*. However, when we proceed to relax any un-relaxed 
neighbors of A^(e.g., B^), the algorithm will check if D^ (i.e., the upstream 
node of node A along the shortest path registered by node A*) and (i.e., 
the wavelength node to be relaxed) are on the same wavelength plane, or are 
of the same color. If they are not, an extra cost is imposed when B^ is 
relaxed by A^. This is equivalent to imposing an extra cost when setting up a 
lightpath through D‘-A^-A^-B^, in which the wavelength conversion 
capability is consumed at node A. To avoid looping, in this case node A^ 
will not considered by a shortest path containing A* and A^. 
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Fig. 3.6. The wavelength graph without any extra node. 

The optimal routing algorithm with a WG graph without any extra node 
added is summarized as follows. 



Input: Soiurce and terminate nodes (S,D); 

1. Derive G(Q,I»'P) , the wavelength graph containing wavelength node 
set (which includes wavelength nodes, extra nodes, source and 
terminate nodes in this case) and link set L with different wavelength 
planes VTe 'P . N„ C £2 is a group of nodes on wavelength plane W. 
Nodes on different wavelength planes are in a family if they are at the 
same physical location. 

2. /* Initiate the network state. */ 

For (each node u in £2 ) Do 

markediu) := false; dist(u) := °° ; SP(u) := null; 

/* SP(u) is the shortest path from m to 5 with total cost of dist(u); */ 

End For 

marked{S) := true; dist{S) := 0; 

3. Select wavelength node v s.t. markedly) == false, v is on wavelength 
plane w, and costly ^)+dist{x) is minimum for all XE or all x at the 

same base node as v, where markedlx) == true and no family of v is on 
SPlx). 

If Icostlv^) == oo ) Stop; End If; 

/* no path can be found; blocking is announced; 

In the above, cost(v,x) := 
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if V and last_SP(x) e N^, VwgW 

• Cjj „ +C^ if V e N^, last_SP(x) e w’, w w’ 

oo if ;t and v are not adjacent 

I 

/* last_SP(x) denotes the wavelength node along SP(x) next to node x\ Cx,v 
is the link cost between node x and v; Cx is a constant for the consumption 
of wavelength conversion on node x; */ 

4. dist(y) : = dist(x) + co 5 t(x,v); 
markediy) := true; 

SP(v):=SP(x)0(x-v) 

/* the shortest path from S to v is the shortest path of x concatenating with 
the wavelength channel (jc-v); note that x and v must be on the same 
wavelength plane so that v can be a neighbor of x. */ 

5. If( v^D)Then 

For (all v’ wavelength-convertible with v) 
markediy’) : = true; distiy’) : = dist(v); 

5P(v’):=SP(v)0(v-v’); 

End For 
Groto Step 3 

Else Return SP(y) and dist(v); Stop. 

End If; 



The above method can only achieve an approximating optimal since we 
relax all the wavelength nodes in the same physical position. The 
computation complexity is 0({K/J +N)-((K-J)-N)) due to the special 

structure of the wavelength graph. The term “K/J + N” comes from the 
fact that we inspect at most K/J + N different wavelength nodes at the 
same physical position in each iteration. The use of the term “{K-J)-N is 
because there are at most (K-J)-N wavelength nodes to be relaxed in the 
entire network before the destination is relaxed. The following section 
introduces an approach for optimal routing with a computation complexity 
0((K+N)-{K-N)). 
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3.3.3 Shortest Path Algorithm with Wavelength Graph (SPAW G) 

This section provides another scheme for performing optimal routing 
with wavelength graph. Unlike the approaches introduced above, the 
wavelength graph in SPAWG [13] has a dedicated link coimecting any pair 
of two wavelength nodes if a wavelength converter is provided at the same 
physical position. Although the number of links is largely increased 
compared with the previous schemes, the scheme can guarantee the 
optimality of the derived lightpath since a single wavelength node is relaxed 
in each iteration. The example in Fig. 3.6 is redrawn in Fig. 3.7 using the 
new wavelength graph technique. 




Fig. 3.7. The wavelength graph technique demonstrated in [75]. 



Due to the special structure of the wavelength graph, the computation 
complexity for deriving a lightpath in the wavelength graph using Dijkstra’s 
algorithm is 0{{K + iV) • ( A" • N)) rather than the path selection process using 
conventional Dijkstra’s algorithm performed in a network with K-N nodes. 
In each iteration of relaxation, the algorithm needs only to inspect K+N 
nodes instead of K-N nodes, since a wavelength node can only be adjacent 
to the other wavelength nodes either on the same wavelength plane (where 
there are N wavelength nodes) or at the same physical location (where there 
are K wavelength nodes). 

3.4 Dynamic Path Selection 

Section 3.3 discussed WG modeling and approaches for optimal routing. 
This section will provide an overview of the dynamic path selection which 
can either be fully-adaptive or partially-adaptive . Dynamic path selection 
can be performed in a centralized manner, in which all the computation tasks 
are taken by the Network Management System (NMS). Although the 
problem of database inconsistency among each node can never occur with 
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centralized control, the NMS has to deal with network events (i.e., requests 
for connection setup or tear-down) one after the other, and can easily incur a 
scalability and reliability problems. In general, centralized control should be 
avoided for the reduction of a time-bottleneck, and the computation efforts 
can be decentralized on all the network nodes to achieve a better robustness 
and scalability. Each node disseminates its link-state periodically to all the 
other nodes according the OSPF/IS-IS protocol, in order to maintain the 
coherence of link-state and traffic engineering databases (see Fig. 2.3). An 
accurate and effective path selection is based on accurate link-state 
dissemination, by which the path selection process is transformed into a 
mathematical formula for finding the most cost-effective paths. To improve 
scalability and robustness, a distributed control is desired, which is detailed 
in the following paragraphs. 

3.4.1 Adaptive Routing 

Due to the wavelength continuity constraint, most of the existing studies 
use either Fixed or Alternate Routing schemes (which will be introduced in 
the next section) to select a physical route for each connection call. 
However, the Fixed or Fixed Alternate Routing have pre-defined alternate 
paths for each node pair, which are totally static to any traffic variation and 
may largely restrict the performance behavior. To achieve better 
performance and adaptability to traffic variation, a fully-adaptive approach is 
desired. Fully-adaptive routing can achieve good performance in finding the 
shortest paths in networks based on the dynamic link-state and a custom- 
designed cost function. The most commonly seen adaptive routing scheme 
uses Dijkstra’s shortest path first algorithm, which consumes a reasonable 
computation complexity, O(iV^), where AT is the number of network nodes. 
To perform fully-adaptive routing in a full-wavelength convertible network, 
the problem is degraded to a routing process in a telephony network. The 
wavelength chaimels along a fiber are interchangeable and can be treated as 
a bulk of bandwidth. However, in the event that networks have no 
wavelength conversion capability or are only partially-wavelength 
convertible, the use of the layered WG approach is necessary before 
Dijkstra’s algorithm can be applied. 

A fully-adaptive routing can turn out to be an optimal routing if the 
selected lightpath is optimal (or the route with the lowest cost from source to 
destination in the network). In general, an optimal routing problem can be 
solved by using any adaptive schemes. The layered graph introduced above 
is an approach to solving the optimal routing problem in all types of 
networks. However, it mainly suffers from the following three problems. 
First, the stale link-state may result in a conflict of network resources during 
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the resource reservation between two LSPs. In the optical layer, this problem 
can be solved by the holding/preemption parameters in the path setup 
message specified in RSVP and CR-LDP. However, provisioning 
dynamicity may be impaired, which degrades the quality of service. 

The second problem is the scalability in the link-state dissemination. The 
flooding of the link-state from every node to the whole network may congest 
the control plane and disturb the other signaling mechanisms, especially 
when the flooding cannot be fully asynchronous. In addition, the periodic 
flooding is also the reason for the stale link-state mentioned above. The third 
problem is that the fully-adaptive scheme can hardly take the path-based 
metrics into consideration, such as the potential bottleneck between two 
nodes (which will be detailed in the latter paragraphs), or the correlation of 
the Shared Risk Link Group (SRLG) constraint (which will be detailed in 
Chapter 7). 

The other type of adaptive routing is called partially-adaptive routing, 
which includes Fixed Routing (FR) and Fixed Alternate Routing (FAR). FR 
and FAR are distinguished by the number of pre-scheduled paths for each S- 
D pair (termed alternate paths, and abbreviated as APs in the following 
context). In general, FR equips each S-D pair with a single physical path for 
path selection. With FAR, nodes are usually equipped with two or more APs. 
For both schemes, each ingress node is equipped with a routing table, in 
which the alternate paths to all the egress nodes are defined at the network 
planning stage (or before the network is running). As a coimection request 
arrives at a node pair, the source looks up the routing table and selects a 
lightpath along one of the APs. 

Since a single path is present for each S-D pair with FR, the RWA 
problem is degraded to a task of wavelength assignment along the fixed path, 
which leads to an ultra-fast routing decision-making process at the expense 
of performance. To operate networks under the architecture of FAR, the 
ingress node selects the best path among those APs according to a pre- 
defined cost function and a dynamic link-state. The dynamic link-state can 
be gathered in the following three ways. First, the ingress node sends a 
probing packet along each AP to the egress after a coimection request 
arrives, and the egress coordinates the probing packets and begins the 
resource reservation along the selected path. Second, periodic probing is 
performed by each source node to all its destination nodes for gathering the 
link-state of the APs defined in the routing table of the ingress. Third, every 
node periodically uses flooding, or minimum spanning tree architecture [10], 
to disseminate dynamic link-state of some corresponding network resources 
to all the ingress nodes. The three methods have their problems, respectively. 
The first approach incurs an extra signaling latency for performing the 
probing mechanism, which damages the LSP setup dynamics. On the other 
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hand, the periodic probing or dissemination may suffer from a stale link- 
state problem, which impairs performance and dynamics if a conflict on 
resource reservation occurs. 

The FAR is more suitable to middle-sized networks than large-sized ones 
for the following reasons. First, a large-sized network may spend a 
significant amount of time on probing each APs after the arrival of a 
connection request. Second, the size of a probing packet - where all the per- 
wavelength link-state along an alternate path is recorded - should be limited, 
so that the probability of a successful probing without any dropping due to a 
transmission error can be kept high. As a result, a large-sized network, in 
which a round-trip-delay and the total node processing time may take a few 
seconds during a probing process, caimot cooperate well with the FAR 
architecture. In addition, path-based link metrics can be easily provided in 
FAR. 

Intrinsically, the FAR has a couple of disadvantages that need to be 
improved. First, the performance behavior is strongly determined by the 
alternate paths arranged for each node-pair, which may 5 deld a load- 
balancing problem if the alternate paths are not under a proper design. 
Second, since the alternate paths are fixed and cannot adapt to the traffic 
variation, the performance is impaired. In Chapter 4 an effective algorithm 
will be introduced to deal with this problem, which also solves the load 
balancing and adaptation problems of FAR. Third, the routing table in each 
node grows in size along with the network, and needs to be updated 
whenever the network topology is changed. 

Partially-adaptive routing can be performed in a fully distributed manner, 
such as hop-by-hop routing or deflection routing. In this case, instead of 
keeping APs for all the other nodes in the network, every node keeps a 
routing table specifying the mapping between the output port and the 
destination. The content of the routing table can be dynamically updated 
according to the disseminated link-state. The above scheme is also called an 
alternate link routing due to the fact that the mapping is not only limited to 
one-to-one, but also can be many-to-one [1 1,12]. 

The dynamic path selection can be performed in either a fully-adaptive 
way in which the network-wide dynamic link-states are necessary for 
making a routing decision, or a partially-adaptive manner in which the 
routing decisions are made according to a limited amount of d)mamic per- 
wavelength link-states. The partially-adaptive routing scheme can be the 
Fixed Alternate Routing where RWA is initiated at the destination node, or 
can also be deflection or hop-by-hop routing in which routing decisions are 
made by each node along the selected path. Although all the dynamic routing 
schemes have their own design originality, the hop-by-hop and deflection 
routing schemes, which only consider local link-state information without a 
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global concern, basically need more elaboration, and may incur fatal 
problems when some routing constraints are imported. In Chapter 4, we will 
present a couple of algorithms of RWA based on the architecture of the 
Fixed Alternate Routing under the requirements of load-balancing, multi- 
granularity, or criticality avoidance, to achieve versatile design objectives. 

3.4.2 Dynamic Wavelength Assignment 

Based on the physical route determined by the approaches provided in 
Section 3.3, the RWA algorithm needs to select a wavelength plane along 
the physical route for allocating a lightpath corresponding to the coimection 
demand. The following is an overview of the proposed dynamic wavelength 
assigmnent schemes. 

The dynamic RWA problems in the multi-fiber networks have also been 
extensively studied in the past ten years. Due to its extremely high 
computation complexity, the task of dynamic RWA is separated into two 
independent sub-tasks - routing and wavelength assignment. In this case, no 
matter a fully- or partially-adaptive routing strategy is adopted, a physical 
path is selected first, and then a proper wavelength plane is assigned along 
the physical path for deriving the best lightpath. For achieving better 
performance, dynamic link-state (e.g., per-wavelength link-state) can be 
taken into consideration while selecting a physical path, so that the 
wavelength assignment is conducted together with the physical path 
selection. Using the WG technique is an approach that can perform both 
routing and wavelength assignment at the same stage in a fully adaptive 
manner. However, it is also subject to a higher conq)utation complexity and 
needs to have full knowledge of network-wide per-wavelength link-state. If 
we adopt the FAR scheme, the above disadvantages can be avoided without 
losing much performance behavior. As we have mentioned in the previous 
section, a lightpath is selected from the candidate physical path (i.e., APs) 
according to the dynamic link-state along those physical paths. 

The proposals of wavelength assignment that cooperate with FAR are 
listed as follows. 

1. Random Fit (RF): first derives all the feasible wavelength planes along 
the pre-determined physical route. The wavelength plane for the selected 
lightpath is randomly chosen from the eligible ones. 

2. First Fit (FF): selects a lightpath among all the eligible ones in a fixed 
order, (e.g., the lightpath with the smallest index is chosen). Compared 
with the RF algorithm, FF is lower in computation complexity because it 
is not necessary to search all wavelength planes before a lightpath is 
determined. The idea behind this scheme is to pack all the in-use 
wavelength channels in order to avoid network resource fragmentation. 
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This scheme performs well in blocking probability and fairness relative 
to its computation complexity, and is preferred in practice. 

3. Most-Used (MU): uses a pre-defined cost function and standardized 
dynamic link-state metric (e.g., the maximum reservable bandwidth on 
the wavelength plane) to select a wavelength plane. The wavelength 
plane of a link with less bandwidth has a lower cost. 

4. Least-Loaded Routing (LLR): chooses the kth wavelength with the 
minimum index in Sp that follows 

max min A . , 

keS, lep 

where A^ is the index of fibers (or optical connections) for which the 

kth wavelength is utilized on link /. Sp is the set of available 

wavelengths along the shortest path p. 

5. Minimum Sum (MS); chooses the kth wavelength in Sp that follows 



Aj 

mm- ’ 



keS„ 



Ft 



where is the number of fibers on link /. 



6. Adaptive Dynamic Routing with x pre-calculated paths (ADRx): selects a 
lightpath from x pre-defined physical routes using the LLR algorithm 
defined in (4). Try dynamic routing if all the efforts fail. 

7. Fixed-Path Least-Congested (FPLC): selects the wavelength plane with 
the widest bottleneck along the physical route. Note that the system has 
to be multi-fiber, so that a wavelength plane can accommodate multiple 
wavelength channels at a time. Otherwise, FPLC is degraded to FF when 
the network is single-fiber. FPLC outperforms MU and FF in terms of 
blocking probability in a multi-fiber network. 

8. Maximum Sum; the total capacity (i.e., the number of total lightpaths that 
can be set up in the network by all S-D pairs) reduced by allocating a 
lightpath is required to be the minimum. 

A blocking occurs if aU the pre-defined physical routes contain no free 
lightpath to satisfy the connection request. Wavelength conversion capability 
can reduce the blocking probability. However, it has decreasing effects as 
the size of the networks increases [8]. Also according to [8], the blocking 
probability is decreasing in the proposals (1) ~ (5). The first scheme that 
randomly chooses a lightpath has a largest blocking probability, while (5) 
yields a minimum blocking probability among (1) to (5). Instead of defining 
only a single physical route, schemes (6) and (7) define two and more 
physical routes to further decrease the blocking probability, which have been 
reported by simulations. The proposal (8) investigates the mutual 
interference between lightpaths of different S-D pairs, and can achieve the 
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best performance. However, it needs global per-flow link-state and huge 
amount of dissemination if distributed control is desired. 

Each of the above wavelength assignment schemes has its original design 
objective. In the first two schemes (i.e., RF and FF), a wavelength plane is 
selected without regard to the dynamic traffic distribution. These schemes 
can be suited in a network that needs ultra high speed lightpath provisioning 
in order to cope with the rapid-changing dynamic traffic. The MU scheme is 
used to reduce the resource fragmentation by packing traffic into the most 
loaded wavelength plane, while the LU scheme is used to achieve load- 
balancing in the network physical topology. FPLC is aimed at allocating a 
lightpath with the widest bottleneck in capacity. Schemes (3) ~ (7) take 
dynamic network link-state into consideration when selecting a lightpath, 
and can reflect the traffic distribution in the network. With scheme (8), an S- 
D pair needs not only the link-state (i.e., wavelength availability) along the 
alternate paths, but also the global per-flow link-state (i.e., the information of 
the lightpaths on the other S-D pairs), to make a routing decision. 

3.43 Link-State Coherence 

We assume that no routing decision is made during the normal operation 
in a network with static traffic. Therefore, centralized control is the most 
suitable strategy, where the NMS handles all the computation efforts of 
RWA at the planning stage such that the problem of link-state coherence 
does not exist. In the dynamic networks, since lightpaths are set up and tom 
down all the time, the centralized control can hardly be feasible for its 
scalability and robustness problems as discussed. The robustness problem is 
due to the fact that the failure of the NMS may shut down the whole 
network. The scalability problem is incurred by a sequential route-searching 
process for the bursty arrival of connection requests. This problem is made 
much worse when the network is heavily loaded. Therefore, a distributed 
control such as the source routing defined in OSPF, is solicited in a dynamic 
network environment. Link-state coherence among each network node 
becomes an issue in this case. This section is devoted to the dynamic link- 
state correlation problem. 

With source routing in a dynamic network, routing decisions are made by 
the source node in a distributed way according to the disseminated link-state 
information fi'om the other nodes. The sacrifice made for this distributed 
scheme is that the link states of the whole network have to be disseminated 
periodically (e.g., 30 seconds a time defined in OSPF). With this, the stale 
link-state problem may occur due to the fact that two or more connection 
requests reserve the same network resources (e.g., a wavelength channel). 
The s)mchronous/asynchronous flooding of link-state packets may congest 
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the network control plane during the dissemination period. To improve this 
situation, the design can make use of the difference between an IP or MPLS 
network and an optical wavelength-routed network. 

A packet-switched MPLS network may have more than 100,000 LSPs 
co-existing at any moment. However, in the network optical layer, the 
number of lightpaths is much smaller. Based on the fact, the setup/tear-down 
of a lightpath (called a network event) in the network optical layer should be 
much less frequent than that of the LSPs in the MPLS packet-switched 
networks. In other words, the traffic variation in optical networks is more 
“discrete” than that of the packet-switching networks. The time period 
between two network events is inversely proportional to the traffic 
dynamics, during which some planning efforts can be made to increase the 
probabiUty of successfully setting up the subsequent connection request. We 
define the planning efforts made during any two arrivals of network events 
as “inter-arrived planning”. 

During inter-arrival planning, the first and most important issue is the 
link-state coherence among the databases of all network nodes. To improve 
the drawbacks seen in traditional OSPF, “Link-State On-demand” is defined 
as a dissemination approach such that the dynamic link-states are broadcast 
by a node whenever there is an update on its corresponding network 
resources (e.g., the availability of an output port on a wavelength plane). 
However, the link-state on-demand scheme is hardly feasible if the arrival of 
a network event is highly frequent except that each node is defined with a 
threshold upon link-state variation above which the link-state update 
dissemination is initiated. Due to the discreteness in the optical domain, we 
can define the threshold as any arrival of a network event. Therefore, the 
path selection process in the optical layer can be totally free from the stale 
link-state problem. 

To further reduce the impact of overhead in performing link-state 
coherence in large-sized networks, the whole network can be divided into 
several sub-domains, in which a sub-network management system (Sub- 
NMS) is devised for each sub-domain as shown in Fig. 3.8. Because the 
number of lightpaths in large networks can be much larger than that of 
smaller ones, and because the signaling latency between two nodes may also 
be large, the feasibility of correlating link-state between different network 
nodes as a part of inter-arrival planning is getting lower. As a result, the fully 
distributed control with the probing technique and the link-state on-demand 
discussed above may not be suitable. The Unk-state dissemination process in 
the Internet has been well-defined in the OSPF and BGP, which supports the 
sub-grouping of network nodes into several sub-areas according to their 
physical locations. Sub-NMS and border routers in each sub-domain provide 
reachability information to the nodes in the same sub-domain when a path 
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selection process is conducted in the nodes. The reachability information 
defines the connectivity between two nodes in different sub-domains. The 
coherence among the databases of Sub-NMS’s or border routers in different 
sub-domains can be conducted in a distributed way, usually done through an 
approach of Duplicated Databases (DB) [13]. 

The setup of a Sub-NMS in each sub-domain is intended to establish a 
compromise between the centralized control architecture and the fully 
distributed control scheme. Instead of performing any computation for 
routing, each sub-NMS behaves as a link-state database, through which the 
nodes in the sub-area ask for the network-wide link-state information and 
reachability on-demand. 




Fig. 3.8. The network domain with the partially centralized distributed 
database system. 

The advantages of this configuration are summarized below. 

1. The amount of the disseminated link state can be reduced from o(n^ )to 
0(n), where n is the number of nodes. 

2. The chance of conflict while setting up a lightpath is reduced since 
coordination time is reduced. 

3. The robusmess is maintained. If a sub-NMS fails, the nodes in the 
corresponding sub-domain are coordinated by the sub-NMS in the 
neighbor sub-area. 

4. The scalability is maintained. A sub-NMS keeps database coherent with 
the other sub-NMSs and supplies link states to the nodes in the sub- 
domain, which can be done with ease. 

3.5 Static Routing and Wavelength Assignment 

This section gives a short overview of the static RWA problem. The 
optimization techniques that can be adopted in solving the static RWA 
problem are an important basis for performing a survivable routing and spare 
capacity allocation, which will be introduced in Chapter 7. 
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With the given traffic matrix that specifies the number of lightpaths (or 
amount of bandwidth) required between each S-D pair, the optimal design of 
allocating lightpaths for satisfying all the connection requests can be 
achieved provided that sufficient computation time is given through an 
optimization approach, such as Integer Linear Programming or Flow 
techniques. However, the optimization process is subject to an NP-complete 
computation complexity in most of the situations. As the network size and 
the number of lightpaths co-existing in the network are growing, these 
optimization algorithms may need an intolerably long computation time to 
complete. Therefore, heuristic algorithms are necessary for a compromise 
between performance and computation complexity. 

3.5.1 RWA by Integer Linear Programming 

It is clear that the multi-commodity flow problem with the traffic 
corresponding to each S-D pair viewed as a different commodity. A 
mathematical similarity is known as the minimum cost network flow 
problem, in which each S-D pair can yield several commodities to transport. 
Since it makes no difference to each commodity across an S-D pair when we 
consider the allocation of lightpaths to satisfy the traffic requests, the 
solution to the minimum cost network flow simply fits the requirement of a 
corresponding static RWA problem. We formulate the problem as follows 
with an optimization objective of minimizing the number of fibers required 
along each network span. 

Assume there are F fibers along a directional link, and there are K 
wavelength channels in a fiber. Let NC^’^ denote the number of connection 

requests from node 5 to D on wavelength plane k. Let denote the 

number of lightpaths from node S to D traversing through link a-b using 
wavelength k, where can be 0, 1, ..., F. Assume we allow two or 

more lightpaths to be set up between an S-D pair. We are given a traffic 
matrix TR in which TRs,d denotes the number of lightpaths needed between 
node S and D. The optimization objective is to: 

Minimize: F 



Subject to the following constraints: 






s,d 



(3.1) 
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for all adjacent node pairs {a, b) and wavelength planes k 






-NCl’° if5==fe 
■ NCl'^ \iD = b 
0 otherwise 



(3.2) 



Y,NCI^ . (3.3) 

k 



where =0,1,2, (3.4) 

Eq. (3.1) is the constraint for the total number of wavelength channels on 
the same wavelength plane to be limited within F. Eq. (3.2) quantifies the 
number of lightpaths accessed, terminated, and traversing at a specific node, 
respectively. Eq. (3.3) is the constraint imposed by the traffic matrix. Eq. 
(3.4) imposes an upper bound on the number of lightpaths traversing a 
network span. 

The same approach may also be applied to obtain the minimum number 
of wavelength planes given the traffic matrix and the number of fibers along 
each span NF. For a given number of wavelength planes W, and the traffic 
matrix Af, any Integer Linear Programming (ILP) solver can be applied to 
find the solution, which is the minimum number of fibers required. 

The alternate problem can be the maximization of the number of 
lightpaths (NL) set up in the network with ILP for a fixed number of 
wavelength planes and a given traffic matrix. 



No_SD 

Maximize: NL= ^nii 
1=1 



Subject to the following constraints: 



m^>0, where i = 1, 2, ..., No_SD 
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EC- a\<F forall(ifc,/) 



for i = 1, 2 , ..., Ns,d 

k p 



nil - ^ = 1 > 2 , Ns,d 

where NC is the total number of connection requests to be settled, qt is the 
fraction of the load which arrives for S-D pair i; No_SD is the number of S- 

D pairs; nii is the number of coimections established for S-D pair i; Sp is a 
binary index which yields a “1” if path p is between S-D pair i, and Sp=0 
otherwise; (Tp is a binary index which yields a “1” if link I is on path p, and 
<Tp = 0 otherwise; is a binary index which yields a “1” if wavelength 
plane k is assigned to path p, and = 0 otherwise. 

3.5.2 RWA with Heuristic Algorithms 

The static RWA problem can also be solved through heuristics instead of 
a time-consuming ILP optimization process. In general, the difference 
between a heuristic and the ILP optimization lies in the way the connection 
requests are inspected and tested. With heuristics, connection requests for 
each S-D pair are allocated into networks one by one, or group by group, 
instead of an optimal deployment through a global search. Based on the 
traffic demand between each node-pair, the number of lightpaths required 
between each node-pair can be determined. With the number of lightpaths 
required between each S-D pair, we allocate lightpaths sequentially into the 
network for each S-D pair by a pre-defined cost function and a suite of link- 
state metrics, which behave as one of the key factors in the determination of 
network throughput. The link-state metrics can be the ratio between the total 
capacity and residual capacity, or the width of bottleneck for a given path. 
The cost function and link-state metrics are user-defined according to the 
objective of network design and the different arrangement and traffic pattern 
in the network. 

The other important factor to the throughput in performing the static 
RWA problem is the sequence of S-D pairs being considered. The static 
RWA problem is degraded to a dynamic path selection process mentioned if 
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we deal with each connection request one after the other. Different from the 
dynamic path selection and wavelength assignment, the sequence of S-D 
pairs to be considered can be manipulated to improve the throughput. Based 
on this characteristic, we can improve the network throughput by randomly 
shuffling the sequence of connection requests to be allocated. The final 
solution is from the trial that yields a best throughput among all the trials of 
different sequences. This is also termed a Statistical multi-trial approach, 
and will be further verified in Chapter 7. 

A variety of approaches to sorting and grouping the connection requests 
upon S-D pairs have been proposed based on the physical location and hop 
count of each lightpath. In terms of sorting, the most commonly seen 
strategies are listed as follows: 

1. Randomly Sequential Shortest path: permutation is generated randomly 
with uniform probability. 

2. Minimum Hop: requests for setting up lightpath are sorted according to 
the hop count of each lightpath. The lightpath with the smallest hop count 
is allocated first. 

3. Maximum Request: requests are sorted by the minimum hop count 
needed by the lightpath allocation requests. Longer lightpaths are 
allocated first. 

The above three criteria can also be applicable when the network is 
required to generate a maximum revenue if the revenue of the connection 
requests for each S-D pair is defined. In the meantime, another design 
strategy out of the above three can be set such that a lightpath with higher 
revenue is allocated first. In general, with the goal of allocating as many 
lightpaths as possible, the second strategy suffers from a lack of fairness, 
although it can yield a much better throughput than the other strategies. 
There is no reason to allocate all the connection requests with shorter 
distance (in hop), while leaving the longer coimection requests to be 
blocked. 

Although the ILP optimization process can yield the best throughput, it 
generally consumes a long computation time. Sequentially allocating 
lightpaths for each S-D pair into the network is faster, however the 
throughput is diminished. A compromise can be found between the two 
extreme cases, namely, that the lightpaths to be allocated be divided into 
several sub-groups. Lightpath setup requests in each sub-group are 
optimized as a whole, while the optimization result of each sub-group is 
allocated into the network one after the other. The larger a sub-group is, the 
better the optimization quality could be and the longer duration of time it 
may consume. 

The grouping of lightpath allocation requests can be based on a specific 
criterion such as the physical location (i.e., location of the S-D pairs), or the 
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minimum distance of a lightpath, or the revenue generated by setting up a 
lightpath. For the criterion of physical location, the lightpath allocation 
requests on the same S-D pair could be grouped together. For the criterion of 
minimum distance of a lightpath, each sub-group could randomly select a 
fixed amount of lightpath allocation requests. For the revenue defined for 
each lightpath, the policy may sub-group lightpaths with higher revenues and 
allocates those lightpaths at a higher priority. 

3.6 Summary 

This chapter has presented several topics in the RWA problem for a 
network with/without wavelength conversion, and with either a dynamic or 
static traffic pattern. The issues of network design and modeling, and the 
objectives and principles for satisfying versatile routing constraints and 
requirements, have been outlined. The constraints imposed on a wavelength- 
routed optical network were presented in Section 3.2, and included the topics 
of the physical constraint and the diversity constraint. Section 3.3 provided 
an approach for modeling optical networks with and without wavelength 
conversion capability. The model using a wavelength graph technique can be 
applied to networks that contain nodes with different wavelength conversion 
capability and ranges. An approach for optimal routing using a modified 
Dijkstra’s algorithm was introduced. Section 3.4 provided an overview of 
the dynamic path selection schemes in which the fully-adaptive and 
partially-adaptive routing schemes were discussed. The link-state coherence 
problem was also introduced and covered the topics of centralized, 
distributed, and semi-centralized control schemes. Section 3.5 outlined the 
RWA schemes with static traffic, which contains the ILP formulations and 
heuristic approaches for solving the problem. 
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Questions 

1. Describe three most commonly seen constraints on the routing and 
wavelength assignment in WDM networks. Explain the origin and causes 
of these constraints. 

2. What is “domain of transparency”? Do you agree that each node in the 
network may belong to multiple domains of transparency? Explain your 
reasoning. 

3. List three physical impairments in the optical network layer. Which are 
linear and which are non-linear? Explain what a non-linear physical 
impairment is. 

4. Why do we need paths to be diversely routed? 

5. List three types of networks according to the distribution and 
homogeneity of wavelength conversion capability. 

6. Why do the wavelength graphs need extra nodes for reflecting the 
wavelength conversion capability in each base node? What are the pros 
and cons? 

7. Write down the modified Dijkstra’s algorithm for optimal routing of 
lightpaths using the wavelength graph with extra nodes. 

8. Explain why the computation complexity of using Dijkstra’s algorithm to 
solve the least-cost path selection problem is 0{N^)\ and also explain 
why the computation complexity is 0((|w|/J + A^)-((|w|-7)- AT)) , 
0((|w| • AT -h £■) • A^) , and 0((|w|+Ar)'(|w|-A^)) for the d)mamic path 
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selection processes with wavelength graph techniques as we use RCF, the 
extra-node method, and SPAWG, respectively. 

9. Compare the computation complexity of RCF, SPAWG, and the extra- 
node method. What is the upper bound and lower bound for the schemes 
given different arrangements of wavelength conversion in each node? 
(Hint: discuss the cases with full wavelength convertible in each node 
and with the arrangement shown in Fig. 3.5). 

10. Can the RCF scheme find least-cost lightpaths? In what circumstance can 
the RCF guarantee to find an optimal solution? Explain your answer. 

11. Compare and contrast the three-wavelength graph modeling approaches: 
simple WG, WG with extra nodes, and WG with modified Dijkstra’s 
algorithm. 

12. What are the advantages and disadvantages of using centralized or 
distributed control? 

13. Define adaptive routing. What is the difference between fully-adaptive 
and partially-adaptive routing schemes? Describe how they are managed 
by citing an example. 

14. Describe the functional principle of the following wavelength assignment 
schemes: First Fit, Random Fit, Fixed Path Least Congested, Most Used, 
PACK, SPREAD, Minimum Sum, and Adaptive Dynamic Routing with x 
pre-determined routes (ADR-x). 

15. Why do we need to sometimes sub-divide a network into several sub- 
networks for link-state coordination? 

16. Why do we need to use heuristics to solve the RWA problem with static 
traffic? Explain the meanings of the Integer Linear Programming 
formulation in Section 3.5. 




Chapter 4 



ALGORITHMS FOR DYNAMIC ROUTING 
AND WAVELENGTH ASSIGNMENT 



In this chapter we present topics of dynamic routing and wavelength 
assignment, with a special focus on performance evaluation. We will first 
introduce a number of network planning algorithms to facilitate solving the 
dynamic RWA problem, which include a d)mamic link-state metrics called 
the Weighted Network Link-state (W-NLS), and a scheme for deploying 
alternate paths (or APs in the following context) into networks under Fixed 
Alternate Routing structure. To inq)rove network performance, two RWA 
algorithms based on the above planning algorithms are presented, namely 
Minimum Interference Routing Algorithm (MIRA) and Asynchronous 
Criticality Avoidance (ACA) protocol, which aim at minimizing the 
interference of allocating the current connection request to subsequent 
requests. The two algorithms achieve load-balancing through different ways: 
one uses minimum cut theory to derive a new link weight, and the other 
deploys APs with criticality information dissemination. Both algorithms 
make routing decisions by taking the network topology, the potential traffic 
loads, and the location of each S-D pair into consideration. 

4.1 Network Planning Algorithms 

In the optical networks where connection requests arrive one-by-one 
without any knowledge of future arrivals (termed a d3mamic network in the 
following context), a path selection process should be designed not only to 
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find a lightpath for current traffic demand, but also to maximize the 
probability of successfully allocating subsequent connection requests. In 
order to make the most use of network resources, any traffic bottleneck that 
may fragment network resources should be avoided as much as possible. The 
task can be realized by using Dijkstra’s shortest path first algorithm along 
with well-designed dynamic link-state metrics (e.g., the maximum reservable 
bandwidth defined in OSPF) [1] to find the least-loaded path for each 
connection request. However, when the network traffic demand is not 
homogeneous for each S-D pair, the balancing of traffic distribution may be 
uneven if no further elaboration is made. A simple example is shown in Fig. 
4.1. Assume that each node-pair has a uniform traffic load. It is clear that a 
traffic bottleneck can easily occur between node A and B. Therefore, when a 
connection request arrives at the node-pair (C, G), although the path C-A-B- 
G is the shortest (in terms of hop count) and may also be the best path if any 
adaptive routing scheme (e.g., Dijkstra’s shortest path first) and dynamic 
link-state has been adopted, the path C-D-E-F-G is undoubtedly a better 
choice in this specific topology. Now if the traffic load is not uniform for 
each node-pair, the path selection process should also take this factor into 
consideration. For example, if node H, I, J, K, L, and M may only launch 
very tiny amount of traffic between each other compared with the other 
node-pairs, we should put more priority to the path C-A-B-G when a 
coimection request arrives between node C and G to avoid a potential 
bottleneck that noay be easily generated along the links C-D, D-E, E-F, or F- 
G. From the above example, we can see that the current link-state is not the 
only factor that needs to be considered in a path selection process, but the 
physical location and potential traffic load of each source-destination (S-D) 
pair are also important. 

In this section, two network planning algorithms are introduced to 
facilitate load-balancing, where the physical topologies of networks, as well 
as the location and traffic load of each S-D pair, are taken into consideration. 
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Fig. 4.1. This example demonstrates that the network topology should be 
taken into consideration as routing is performed. 

4.1.1 Weig{ited Network Link-State 

Load-balancing is always an issue in improving network throughput. 
Traffic distribution is balanced in the networks so that no link over- or 
under-utilization takes place at any time. While OSPF has been extended 
with traffic engineering link-states (e.g., maximum reservable bandwidth, 
etc), it is still extremely hard to solve this network-wide problem with these 
link costs. In addition to the link-state parameters, the network topology, the 
location of each S-D pair, and the load of each S-D pair can also play a key 
role in determining the traffic distribution in the network. In this sub-section, 
a link metric called the Weighted Network Link-state (W-NLS) is 
introduced, to jointly quantify the above three criteria. 

The dynamic networks may have different traffic loads on different S-D 
pairs, which launch traffic into the network links according to a specific 
routing algorithm. Each link in the network behaves as a server that accepts 
and releases its clients (i.e., the occupancy of wavelength chaimels along the 
link) at specific arrival and departure rates. Since the arrival and departure 
rates of connection requests follow a Poisson model, and are independent of 
different links, we can model each link as an M/M/K/K lossy system. The 
traffic load along a network link is a linear combination of the contributions 
by all the S-D pairs in the network. 

The physical route taken by a connection request between an S-D pair is 
most frequently among a collection of routes (e.g., the shortest or the second 
shortest paths in hop count). The traffic from each S-D pair is launched onto 
the network through these paths. Therefore, we can approximate the 
potential traffic load across each directional link by summing up all the 
traffic contribution to the link from each S-D pair through the specific 
collection of routes (also called Hypothetical paths in the following context). 
According to the path selection algorithm adopted, the hypothetical paths for 
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each S-D pair should be different. For Fixed Alternate Routing, the 
hypothetical paths can simply be the pre-defmed alternate paths for each S-D 
pair. Each one of the alternate paths connecting between an S-D pair has an 
equal tendency to be adopted when a connection request arrives. However, 
for the fully-adaptive routing, the number of hypothetical paths may be very 
large so that it is not possible to consider all possibilities. To cope with this 
situation, we can approximately take a specific group of routes (e.g., all the 
shortest paths in hop count) for calculating the potential traffic load across 
each directional link, which is a linear combination of all traffic loads issued 
from the S-D pairs with hypothetical paths passing through the directional 
link. 

Based on the potential traffic load of a directional link, we define w^j^oi 

link a-b as the W-NLS, which reflects the importance of link a-b in terms of 
its physical position and traffic load issues by each S-D pair. The W-NLS is 
expressed in the following relationship: 



w. 



a,b 









M, 



Itjyfn 



'•J 



where A, ^is the potential traffic load between S-D pair (ij), Mij is the pre- 
planned path set between the S-D pair (iJ), |m ^ ^ | is the number of paths in 
Mij, and a. ^ is the weighting on the mth pre-plaimed route of the S-D pair 
(iJ). The binary path-link incidence index d“f^ is defined as: 



_ I ^ of S - D pair (i, j) passes link a-b 

1 0 otherwise 



The hypothetical paths of each S-D pair for calculating Wy can be all the 
shortest paths in hop count, or all the shortest paths as well as some non- 
shortest paths. The parameter j „ is a weighting on the tendency of the mth 



hypothetical path to be taken. In general, the shortest paths in hop count 
between an S-D pair is used much more often than other longer paths; 

therefore, Ct,- , „ can be set such as • „, = -^ or Of, , „ = , where k„ 

»y> 1 L 



is the hop count offset of the mth path from the shortest one, or 



jl if*„=0 ^ 

[O otherwise 
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where only the shortest paths are considered. 

We have an example of showing how to derive W-NLS in Fig. 4.2 with the 

above equation as the criterion to determinett; ^ . There are four shortest 

paths in hop count between node 3 and node 13. The traffic in the network 
contributed by the S-D pair (3, 13) is highly likely to load onto the links 
along the four shortest paths. If there is no preference on the path selection, 
each of the four paths could be equally weighted (i.e., a,.j „ = 1 and 

|m,.^| = 4). Applying this process to all S-D pairs in the network, Wa,b is 

derived by a linear combination of the traffic load from all the S-D pairs that 
are likely to traverse it. 




Fig. 4.2. An example for deriving the Weighted network link-state (W- 
NLS) for the link (i, j) by a process of linear combination on the 
contributions from all the S-D pairs that are likely to traverse the link. 

In some cases the hypothetical paths for each S-D pair should not be 
limited to the shortest paths but also the sub-optimal paths, depending on the 
routing strategy adopted and how much the network is loaded. If the network 
load is heavy so that the average length of paths is long, considering the 
second shortest paths may improve the accuracy of using W-NLS to quantify 
the importance of a directional link. 

A simulation was conducted for verifying the algorithms of the W-NLS 
under the cases with different number of hypothetical paths using the 16- 
node and 22-node networks shown in Fig. 4.2 and Fig. 4.3. The number of 
wavelength chaimels in a fiber is 16, and the number of fibers in a 
directional link 5. The number of coimection requests for a trial is 60,000, 
and the final data is an average of the results of 4 trials. The routing schemes 
adopted for comparison in the simulation are First Fit (FF) [2,3,5], Random 
Fit (RF) [2,3,5], Fixed-Paths Least-Congested (FPLC) (or Least Loaded) 
[3,4], and Adaptive Dynamic Routing (ADR) [1,3]. In the FF scheme, the 
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searching of a lightpath follows a fixed order of pre-planned APs and the 
wavelength planes on each of the paths. The decision is made once a feasible 
lightpath is found. In the RF scheme, the working lightpath is randomly 
chosen among all the feasible lightpaths in each of the wavelength planes 
along each alternate path. In the FPLC, the lightpath that is least congested 
in the most heavily loaded link gets the highest priority. The W-NLS is 
adopted when two or more feasible lightpaths with the same priority (or 
equally congested in their most heavily loaded link) are found. For the ADR 
scheme, the cost function that does or does not cooperate with the proposed 
W-NLS is shown as follows: 

C„h = 1 + — %— for the case where the W-NLS is not considered; and 
. F 

C„h = 1 + r- for the case where the W-NLS is considered. 

BBlj, 

In the above, F is the number of fibers and is the number of 

available wavelength channels on the kth wavelength plane. When a 
connection request arrives, the Dijkstra’s shortest path first algorithm is 
performed on each wavelength plane with the link cost derived by the above 
cost function. Table 4.1 lists the symbols of experiments with different 
arrangement of alternate paths in this study. 





Alternate Paths 


1 


2D: A link-disjoint path-pair with shortest total length 


2 


Min_Path(S, D, x), where x = 0* for all S-D pairs 


3 


Min_Path(S, D, x), where x = 0~1* for all S-D pairs 


4 


Min_Path(S, D, x), where x = 0-2* for all S-D pairs 


5 


Min_Path(S, D, x), where x = 0-3* for all S-D pairs 


6 


Min_Path(S, D, x), where x = 0-4* for all S-D pairs 



Table 4.1. The experiments for verifying the Weighted network link-state 
*x = 0 ~ n means that the alternate paths are the shortest, second shortest,.. ,and the 
nth shortest paths between an S-D pair 
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6 




Fig. 4.3. The 22-node network 



The algorithms for deriving the loop-less k-shortest paths in this 
simulation can be seen in Appendix A. The algorithm used to derive the 
link-disjoint path-pairs with the shortest length in total will be introduced in 
Chapter 8. We define that Wa,b for all {a,b) derived with the hypothetical 
paths of all the shortest paths as “weighted 0”, and the case with all the 
shortest paths and the second shortest paths as “weighted 1”, etc. 

From the experiment results, ADR weighted 0 outperforms FF, RF, and 
ADR, and has a similar performance to FPLC in some cases. It is clear that 
having a proper number of alternate paths can achieve a best performance 
with FF, RF, and FPLC. Let us take a closer look at the result of each 
experiment. 

The simulation results with the FF scheme for the 22-node network are 
shown in Fig. 4.4. The performance of the case x = 0~2 yields a best 
performance among all the cases of FPLC. In the 16-node network, the cases 
of FF with X = 0~2, x = 0~3, and 2D have a similar performance. 

Simulation results with the RF scheme are shown in Fig. 4.5, where x = 
0~1 in the 16-node network, and x = 0~2 in the 22-node network, yield the 
best performance. Providing longer alternate paths cannot improve 
performance due to the fact that the RF scheme selects a working lightpath 
randomly from all the feasible lightpaths. The lengthy candidate lightpaths 
may not only decrease the blocking rate, but also increase the average 
consumption of network resources by a connection. 

The simulation results for the case FPLC is shown in Fig. 4.6. Except for 
the case x = 0, there is no significant difference between each case in both 
16-node and 22-node networks. 
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Blocking Prdb. vs. Load (16-node of FF and ADR) 
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Blocking Prcto. vs. Load (22-node of FF and ADR) 



100 . 00 % 



vveightedO 



(a) 



Fig. 4.4(a), (b). The simulation results of using the First-Fit (FF) RWA 
scheme under different number of alternate paths in the 16-node (a) and 22- 
node (b) networks. 
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Fig. 4.5(a), (b). The simulation results of using the Random Fit (RF) 
scheme under different number of alternate paths in the 16-node (a) 
and 22-node (b) networks. 



Fig. 4.7(a) and Fig. 4.7(b) show the comparison among different routing 
schemes in the 16-node and 22-node network, respectively. We select the 
cases with an arrangement of alternate paths that yield best performance in 
each RWA scheme (i.e., ADR, FF, RF, and FPLC). In the 22-node network, 
the ADR with weighted 0 outperforms the other RWA schemes, while in the 
16-node network there is almost no difference in performance due to the fact 
that the 16-node network is small in size so that the effects by using different 
routing strategies are not significant. 
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Blocking Prob, va Load (FPLC 1 6 -node) 
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(a) 



Blocking Prob. vs. Load (FPLC 22<node) 




Load 
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weighted 0 1 



(b) 



Fig. 4.6(a), (b). The simulation results of using the Fixed-Paths Least 
Congested (FPLC) RWA scheme under different number of alternate paths 
in the 16-node (a) and 22-node (b) networks. 
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Blocking Prob. vs. Load (l 6-node) 




Load 



(a) 




Fig. 4.7(a), (b). Comparison between the RWA schemes FPLC x = 0~2, 
FF X = 0~3, RF X = 0~1, and ADR weighted 0 in the 16-node and 22-node 
random networks. 

4.1.2 Planning for Fixed Alternate Routing 

Here we will focus on solving the problem of arranging alternate paths 
(abbreviated as APs) for each S-D pair, and how the different patterns of 
APs influence the performance behavior. 

In Section 4.1.1 we conducted a comparative study of the performance, in 
terms of blocking probability, between the cases with different arrangement 
of APs, including the case of 2D and a complete set of k-shortest paths 
between each S-D pair. The simulation showed that using a moderate 
number of k-shortest paths as APs with a limited length in hop count can 
achieve the best performance. However, using a large number of APs may 
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require a large amount of probing traffic and computation efforts. The 
simulation conducted for verifying the W-NLS did not consider the efforts of 
maintaining those APs and the cost (i.e., computation latency, extra control 
packets, and computation resources at source and destination nodes, etc.) of 
dealing with hundreds or even thousands of APs when a connection request 
arrives. In addition, since the use of k-shortest paths as alternatives did not 
consider the fact that network resources along an AP may highly overlap 
with the others between an S-D pair, a further elaboration on the selection of 
APs can simply reduce the cost in control and management. An example is 
shown in Fig. 4.8(a). With PI and P2 as APs between node A and B 
obviously yields a better performance than the pattern given in Fig. 4.8(b), 
where an exhaustion of bandwidth on the links between node C and B can 
block both PI and P2. To analytically verify the above statement, we assume 
that the probability for a link to be blocked is B regardless of where the link 
is. The blocking probability is 1-[1-(1-(1-B)^)^] (1-B)^ for the case in 
Fig. 4.8(a), and [i-(i-B)^]^ for the case in Fig. 4.8(b). Fig. 4.8(c) shows that 
with the same probing and computing efforts, the diverse deployment of APs 
between an S-D pair increases the odds of having a successful path selection. 

From another point of view, if the APs of all the S-D pairs in the network 
are not spread homogeneously, a traffic bottleneck can be generated more 
often. An example is shown in Fig. 4.9(a) and Fig. 4.9(b), where two 
arrangement patterns of APs for S-D pairs Sl-Dl and S2-D2 are set up. With 
the same traffic load on each S-D pair, the pattern in Fig. 4.9(a) yields a 
better performance than the case in Fig. 4.9(b) as it spreads the APs more 
homogeneously onto the whole network, where the total working capacity 
for the two S-D pairs is larger. Note that we assume there is no other S-D 
p air in the network that can launch traffic at any time. 

In the following sub-sections, we introduce an approach for deriving an 
optimal deployment of APs for each S-D pair, called Capacity-Balanced 
Alternate Routing (C-BAR). C-BAR helps achieve a maximum diversity of 
deployment of APs between each S-D pair and homogeneous distribution of 
the APs in the whole network. In order to prevent the scalability problem 
when using Quadrature Integer Programming (QIP), a heuristic algorithm is 
presented as an approximation of the optimal solution. 
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Blocking Performance versus Probability of Link 
Congestion with Different Deployment of APs 
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Fig. 4.8. An example to show the performance improvement by properly 
arranging APs between node A and B. 




D1 




(b) 



Fig. 4.9. An example to show different arrangements of AP set for the S- 
D pairs Sl-Dl and S2-D2. 

4.1.2.1 Assumptions in Network Environment Modeling 

We assume that loop-less k shortest paths from each source to all its 
possible destinations have been prepared in the routing table of every source 
node. We also assume that connection requests follow Poisson arrivals and 
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departures. Without loss of generality, the departure rate of each S-D pair is 
set to 1. A link is directional with a fixed number of fibers (i.e., F), which 
accommodates a fixed number of wavelength planes (i.e., K). In other words, 
a directional link has F fibers or K wavelength planes. Since no wavelength 
conversion is allowed, it is notable that the wavelength planes in a fiber are 
not interchangeable in terms of switching function, while the wavelength 
channels in a wavelength plane are interchangeable (since they have the 
same wavelength). 

4.1.2.2 Optimization for Alternate Paths 

In brief, the C-BAR algorithm is used to pick up a specific number of 
paths fi:om the k-shortest paths in each node for coordinating to all the other 
end nodes as APs, with an objective of spreading the traffic load 
homogeneously (i.e., minimizing the deviation of potential load along each 
span). We formulate this optimization problem into a QIP process as 
follows: 



Minimize ^ i^a,b “ , 



(4.1) 



Subject to the following constraints: 



w 



a,b 



= s 

all(ij) 



Pi, 



i.J 
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a,b 



a,te9l ^ ^ 



a,b ’ 



(4.3) 





(4.4) 



where: 

Mij is the AP set (i.e., alternate path set) between ingress i and egress j, (or 
KPij is the k-shortest path set between (/,/); 
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Wa,b is the potential traffic load launched onto the span a-b; 

is a binary indicator that yields a “1” if the kth candidate path in KPij 

traverses span a-b, and “0” otherwise. The indicator can be determined right 
after the k-shortest path set for each node pair is given. 

E is the number of spans in the network; 

is a binary parameter that yields a “1” if node a and node b is adjacent, 

and a “0” otherwise; 

I^j is a binary indicator that is 1 if the kth path in KPij is taken as an AP 

between S-D pair (ij) and 0 otherwise. 

Eq. (4.1) is the target function of this QIP formulation, which is aimed at 

minimizing (w^ j, -w)^, or the deviation of potential traffic on link a-b. 

The two terms represent the two optimization objectives outlined at the 
beginning of this sub-section. Eq. (4.2) gives the traffic load launched onto 
link a-b. Eq. (4.3) is the average traffic load on each directional link. Eq. (4.4) 
describes the AP set My as selected from the k shortest path set KPij. 

An illustration of the above notation is given in Fig. 4.10 showing a pattern 
of arrangement. The pattern picks up the paths with dark lines as alternatives. 




kth shortest path 



Node j 



Node 



Fig. 4.10. An illustration of a pattern of arrangement of the APs between 
node i and j. 

4.1.2.3 Design of the Number of Alternate Paths 

The above discussion does not touch the problem of designing the number 
of APs between node i and j (i.e.,|M,^|). This section is aimed at this task. 

This section demonstrates a methodology for designing |m, ^| under a 

particular performance requirement in blocking probability. The design 
objective is to achieve the same (or at least a similar) blocking performance 
for the S-D pairs with different minimum distance. 

It has been reported that the longer an S-D pair is apart, the greater the 
likelihood of it being blocked [7]. In a simulation study. Fig. 4.11 shows the 
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histogram of hop count of the connection requests that were blocked during 
the simulation. It can be seen that the requests with longer minimum distance 
(in hop count) suffer a higher probability of blocking regardless of the time 
of the traffic load. 







Fig. 4.11. Experiment results showing that the S-D pairs with longer 
distance suffer from a high blocking rate. The network in Fig. 4.9(a) is 
adopted, using the Fixed-Paths Least-Congested (FPLC) scheme with 2 
shortest link-disjoint paths as alternatives between each S-D pair. 
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The above simulation results can also be observed through the following 
discussion. A successful setup of a lightpath is a concatenation of 
wavelength channels that are available and on the same wavelength plane 
along an AP. The blocking probability for the connection requests between 
node pair (i,j) can be simply formulated as; 



A 

nn 



1 - 



(.a,b)sAP„ 



(4.5) 



where OB(a,b),k is the probability of blocking for link a-b on wavelength plane 
k. 

Connection requests with long distance impair network performance in the 
following two ways: First, blocking is more possible, as discussed above. 
Second, long lightpaths consume more network resources, and as a result 
also increase the blocking probability of subsequent connection requests. If 
we block all the connection requests with longer distance (e.g., all the 
connection requests with a hop count larger than or equal to 4 in the 
experiment demonstrated above), the overall performance, in terms of 
blocking probability, can be improved immediately. However, it does not 
make any sense for a service provider to give up all those connection 
requests with long distance. In this chapter, the design objective is to ensure 
that every connection request is subject to the same blocking rate (or as close 
as possible). Under this requirement, we expect that the S-D pairs with 
longer minimum distance should be assigned with more APs than those of 
the S-D pairs with shorter distance. 

In the following discussions, we will derive according to Hij (i.e., 

the minimum distance between node i and j) through an approach of 
modeling the traffic across each wavelength plane. The reason we did not 
model traffic in a fiber is that the wavelength channels in a fiber must be 
subject to the wavelength continuity constraint, while the channels in a 
wavelength plane are interchangeable and can simply go through the 
analysis of a queueing modeling. Studies on analytical modeling in blocking 
probability for the wavelength-routed WDM networks have been extensively 
reported in the past. However, they are limited to single-fiber networks. We 
will demonstrate an approach for analytical modeling of performance in 
multi-fiber WDM networks. 

We assume that no matter what value of M . J is taken for each S-D pair, 

after the optimization process by the QIP formulation each link has the same 
(or similar) potential traffic load, which is equivalent to having the same (or 
similar) probability of occupancy. In other words, we can take Wa,b = w for 
all adjacent (a,b) and OB = OB(a,b),k for all wavelength plane k and adjacent 
(a,b). We also assume that the traffic load across a link will be 
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homogeneously distributed to each wavelength plane on that link. Therefore, 
the potential traffic load on a wavelength link (i.e., p ) is the total traffic load 

across the directional link divided by the number of wavelength planes 

w 

(i.e., p = — ). Since a wavelength link follows the behavior of an M/M/K/K 
K 

lossy system, the relationship between OB and p is given by the Erlang’s 
loss formula as shown below [9]: 



OB, 



(a,b),k 



p^/F\ 

F 

Stt 

/=o J- 



for all adjacent (a,b) and k. 



(4.6) 



where F is the number of lightpaths that can be accommodated in a 
wavelength link, and is the probability for the wavelength plane k 

on link a-b to be fully occupied (i.e., all F wavelength channels on 
wavelength plane k is occupied). 

We assume that APs between each S-D pair are link-disjoint and have a 
minimum hop count. Eq. (4.5) can be simplified as: 

PB, j = t - (1 - OBf'^ , (4.7) 

where is the blocking probability for the lightpath connection requests 
between node i and j. FromEq. (4.7), \m^ j^ is derived as: 




ln(Pg,,) 



(4.8) 



Assume that a link has 5 fibers (i.e., F = 5) in one direction, and that each 
fiber has 16 wavelength channels (i.e., K = 16). In other words, a directional 
link has 16 wavelength links, each of which contains 5 channels. Fig. 4.12 
models the traffic across a wavelength plane. In Fig. 4.12(a), the traffic load 
upon link a-b (i.e., iv ) is evenly distributed to each wavelength plane. 

w L vv 

Therefore, the traffic load to each wavelength plane is /? = — ^ = — for all 

K K 

(a,b). Fig. 4.12(b) shows that a wavelength plane is modeled as an Erlang’s 
loss system with 5 servers. 
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Arrival rate: 



There areK' wavelength planes 












(a) 



A wavelength plane / /■ / 
on link a-b 



1 ;! ! ^ \ Departure 

rate = 1 



1 


2 


3 


4 





t 



p =^^= — 

K K 



(b) 



Fig. 4.12. Modeling for a wavelength plane on link a-b. (a) The traffic load 
of a directional link is w^j, = w for all adjacent (a,b), which is defined in 

Eq. (4.2). (b) Modeling of a wavelength plane as an Erlang’s loss system. 

w b 

The arrival and departure rate of a lightpath is modeled as p = — ^ and 1, 

K 

respectively. 



We set up the design such that the S-D pairs with the longest distance (i.e., 
Hij = 5) in the sample networks (in Fig. 4.9) have = 4 with PBij less 

than or equal to Pq . Let Pq = 0.5%, 1%, 2%, 5% or 10%. Using Eq. (4.7), 
we get the corresponding wavelength plane occupancy B = 0.397, 0.413, 
0.432, 0.460, and 0.488 from Fig. 4.13(a), which is a plot of Pq versus B. 
Then Eq. (4.6) is used to plot p versus OB in Fig. 4.13(b). Therefore, traffic 
load across a wavelength plane is derived as p = 6.57, 6.78, 7.12, 7.64, and 
8.17 corresponding to OB = 0.397, 0.413, 0.432, 0.460, and 0.488. Then by 
using Eq. (4.8), the relationship between and Hy under different 

traffic load across a wavelength plane is illustrated in Fig. 4.13(c). Note that 
each of the curves in Fig. 4.13(c) describes the minimum number of 

required to achieve the pre-defined blocking performance requirements (i.e., 
PBij = 0.5%, 1%, 2%, 5% or 10%) for all (ij), which is also summarized in 
Table 4.2. 
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Plot of M (number of APs) versus H (minimum distance between a C-S pair) 




Fig. 4.13. (a) Relationship between Pq and B, where is the blocking 

probability for the S-D pairs with | = 4 and Hij = 5. P^ is initialized 

as 0.5%, 1%, 2%, 5%, and 10%. (b) Relationship between p (i.e., 
probability of congestion for a wavelength plane) and B. The corresponding 
values of p for Pq can be derived, (c) Plot for [ for S-D pairs with 

different minimum distance under a specific blocking performance 
requirement. 



We take the case of — = 6 , which yields an approximate link utilization 
K 

(JJT) of 76.8% using Eq. (4.7). Table 3.2 shows the number of APs (|m,. ^ | ). 
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Table 4.2. The estimated number of APs for the S-D pairs with different 
minimum distance for achieving a specific blocking probability. 
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In the above analysis, the reason for not setting up the design with lower 
blocking probability by increasing is briefly stated as follows. We 

have made the assumption that the APs between a S-D pair are link-disjoint 
and with a minimum distance. However, as |m,^| is increasing, the 
assumption is more distorted from a practical situation. Therefore, the value 
of |m, ^1 has to be within a reasonable scope (e.g., constraint of nodal degree) 

so that the overlapping between APs of an S-D pair can be possibly 
negligible. Note that the design approach and the derived results shown in 
Table 4.2 are generally for all densely meshed WDM networks. The input 
parameters to the analysis are the desired blocking performance (i.e., PBij) 
and the number of APs available to those S-D pairs with the longest distance 
in the networks. The information of network topology and traffic pattern are 
imposed to the design through the QIP formulation in the previous paragraph. 



4.1.2.4 A Heuristic Algorithm 

Although the optimization of QIP provides a global optimization for 
deploying the APs for each S-D pair, the QIP optimization process obviously 
suffers from a scalability problem. Here a large number of k-shortest paths 
and required APs between each S-D pair may make the calculation elapse 
forever. It can be easily observed that the number of k-shortest paths 
between each node pair in the above QIP formulation dominates the 
computation complexity. Besides, the N-square property of any additional 
node expanded to the network may enlarge the searching space 
exponentially. To reduce computation complexity, a heuristic algorithm is 
proposed to solve the problem by sequentially performing the optimization 
for each S-D pair, which reduces the computation complexity from 



o(n 





)toO(2 





In this case, the complexity is linearly 



increased as the number of S-D pairs is increased. 

The S-D pairs are pushed into a stack, SP, with a random sequence. Each 

S-D pair is assigned a |Af,- J according to Table 4.2 with a specific 



performance requirement and network environment. The algorithm starts by 
popping each S-D pair from the stack and processing the S-D pair with the 
same optimization objective and constraints outlined from Eq. (4.1) to Eq. 
(4.4). In this study, all the shortest, second shortest, and third shortest paths 
are adopted as the k shortest path set for each S-D pair. After a S-D pair is 
completed, the result (i.e., the arrangement of APs for the S-D pair; and the 
array, W, for storing Wa,b) is recorded. The optimization for the subsequent S- 
D pair in the stack has to be based on W, which keeps the potential traffic 
load on all the network links at that moment. The iterative process does not 
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stop until the stack is empty. Fig. 4.14 illustrates the flowchart for 
summarizing the heuristic algorithm. 

It is clear that the sequence of S-D pairs in the stack influences the results. 
To improve optimality, the heuristic is performed multiple times with 
different sequences of S-D pairs processed under the maximum allowable 
time limitation. This approach is termed a “Statistical multi-trial approach”, 
and will be discussed in detail in Chapter 7. The final result is the one with 
the lowest cost. 




Fig. 4.14. Flowchart of the heuristic algorithm for deriving APs for each 
S-D pair 

4.1,2.5 Verification of the Algorithm 
Table 4.3 compares statistics by using different strategies to plan APs of 
each S-D pair. The potential traffic load for each S-D pair is randomly 
distributed with ^ =7 plus a random fluctuation (±50%). With this, we 

take |m, J as 1, 1, 2, 3, and 4 for the S-D pairs with Hij = 1, 2, 3, 4, and 5, 

respectively. The path set KP,j for node pair (i,/) needed in the QIP 
formulation is composed of all the shortest, second shortest, and third 
shortest paths in hops between node i and j. The normalized deviation ND is 
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1 ^ II 

defined as: ND = j-t- V i_f£_ — L, where |L| is the number of links in the 

aU(aJ>) ^ 

network. The normalized extra length (nel) is defined as: 
1 ovg_length 

nel = V — , where avgjengthij and Hij is the 

no _SD aiiii.j) 



average length of APs and the minimum possible length between S-D pair 
(ij) in hops, respectively. 

We can see that C-BAR successfully distributes the potential traffic load to 
the whole network, and much better than schemes SI, S2 and RD. In 
addition, C-BAR can bring more marginal benefit as the working capacity is 
increased (or the number of APs) between each S-D pair. The use of the 
heuristic algorithm still yields a significantly better arrangement of APs than 
that of SI, S2 and RD. We will further verify the performance advantage of 
using the C-BAR algorithm along with some other path selection schemes by 
launching dynamic traffic into the networks in the following sections. 







Normalized 
Deviation (ND) 


Normalized Extra 
Length (net) 


Average No. of APs 
for each S-D 


22- 

node 


C-BAR 


12.2% 


1.05 


2.18 


SI 


38.5% 


1.00 


2.46 


S2 


33.8% 


1.64 


11.85 


RD 


64.8% 


1.00 


1 


30- 

node 


C-BAR 


11.6% 


1.04 


2.31 


SI 


48.5% 


1.00 


2.88 


S2 


42.2% 


1.61 


13.18 


RD 


53.6% 


1.00 


1 



Table 4.3. A comparison between four planning strategies. “C-BAR” uses C- 
BAR algorithm with the heuristic introduced in this section. One hundred trials were 
performed before the final data was compiled. “SI ” uses all the shortest paths ( in 
hop count) as APs for each ingress-egress pair. “S2” uses all the shortest and 
second shortest paths as APs; “RD” randomly selects a shortest path as AP, which 
is degraded to the case of Fixed Routing; the data is derived by averaging 100 
results of random selections 



4.2 Routing and Wavelength Assignment with Criticality 
Avoidance 

We will introduce two algorithms in this section, the Minimum 
Interference Routing Algorithm (MIRA) [8] and the Asynchronous 
Criticality Avoidance protocol (ACA), which perform criticality avoidance 
(or minimum interference) routing in optical networks. 
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4.2.1 MIRA 

MIRA derives critical links for each S-D pair in a fully adaptive manner. 
The objective of this scheme is to minimize the interference from the 
allocation of the current connection request to the subsequent potential 
traffic demand. 

An algorithm called Widest-Shortest Path First (WSP) strategy [8], which 
dynamically finds a feasible min-hop path between source and destination, 
such that the chosen min-hop path has the maximum residual link capacity in 
the bottleneck along the path. Instead of merely inspecting the shortest paths 
between an S-D pair, MIRA enhances WSP by using the maximum flow (or 
MaxFlow) algorithm (see Appendix B) to derive “critical links” in the 
network, in which both the traffic load and the location of each S-D are 
considered. 

The basic idea of MIRA is to find an explicit path for a coimection 
request, where the path has a minimum influence on the maximum flow of 
all the other S-D pairs in the network. Therefore, the explicit path can be 
derived by using a maximum flow algorithm invoked upon all the S-D pairs 
whenever a connection request arrives. In other words, the explicit path 
should yield minimum mutual interference with the maximum flows of the 
other S-D pairs, or maximize the smallest maximum flow value of each S-D 
pair. Let Z be the maximum flow value of each S-D pair. The above task can 
be formulated into an Integer Linear Programming process as follows. 



maximize Z 

NA D • V(S,D) e SD/(a, b) 


(4.9) 


NAx°'^=-B_oc 


a.b 

■ e ' 


(4.10) 


XS,D ^ ^ 


V(S,D)e SD/(a,b) 


(4.11) 


^ — ^S,D ' ^S,D 


V(S,D)g SD/(a,b) 


(4.12) 


x^'^>0 


V(S,D)g SD 


(4.13) 






(4.14) 



In the above formulation, SD is the set of node pairs that can launch 
traffic into the network, and SDI(a,b) is a node pair set containing all the 
S-D pairs except (a, 6). NA is a node-arc incidence matrix, which records the 
network topology. If an arc exist between node u and v, the element in the 
column corresponding to arc («,v) is +1 at the u-th row, and -1 at the v-th 
row. Let m denote the number of links and n denote the number of nodes, 
and is an /n-vector that depicts the links traversed by the flow between 
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node pair (S,D). Node pair {a,b) is where the current connection request 
arrives with a bandwidth B_oc. R is the bandwidth of a link. 6^ ^ is a scalar 
representing the maximum bandwidth that can be sent between S and D. 
a^ n is a weight on the traffic launched by node pair (S.D). Let 

represents an n-vector with a “+1” in position D and “-1” in S. 

Eq. (4.9) is the maximum flow problem for each of the S-D pairs in SD 
except the current pair (a,b). Note that 65 ^ represents the maximum flow 

value for the pair (S.D). Eq. (4.10) states that B_oc units of flow have to be 
sent between nodes a and b. Eq. (4.11) ties up the variables for these 
problems. It ensures that the maximum flow problems only utilize the arc 
capacities that are left over after routing the current demands. The non- 
negativity restrictions are specified in Eq. (4.13) and Eq. (4.14), which 
ensures that the demand is routed along a single path. 

The above formulation is subject to a drawback in that only the minimum 
maxflow between an S-D pair in the network influences the optimization. 
Therefore, the object function in the above formulation can also be replaced 
as: 



max Yj^s,d'^s,d 

(S,D)eP/(aJ>) 

However, this scheme is subject to a couple of fatal drawbacks. First, 
since solving the maximum flow problem is NP complete, it is very unlikely 
to be implemented for a real-time path selection purpose. Therefore, a 
heuristic was proposed in [ 8 ] by purely using Linear Programming 
Relaxation. If the programming process results in a non-splitting solution 
corresponding to the connection request, the solution for the connection 
request is the optimal. If the derived solution is in a form of split flows, 
which can never be feasible in optical networks, the algorithm may probably 
fail to find any path. Another drawback is that the algorithm cannot 
cooperate with any constraint imposed upon the path selection process, such 
as the path length in terms of hop count. Therefore, we have enough reasons 
to abandon the idea of deriving an explicit path using the Integer Linear 
Programming or Linear Programming Relaxation formulation. 

A better alternative is to determine appropriate weights for each link in 
the network, and then route the demand along the weighted shortest path. 
The weight of a link represents the “criticality” to the potential future 
demand if an incremental amount of the current demand (or per unit of 
bandwidth demand by the current request) is routed on this link. Therefore, 
the algorithm first computes the maximum flow values for each S-D pan- 
launched onto each network link for deriving the minimum-cut of each S-D 
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pair. The links that are found as the minimum cut between the S-D pair (S,D) 
are gathered in the critical link set, Cs,d- The weight on link l,wei{l) , is the 
linear combination of all the critical links Cs.d if I e Cj p , i.e., 



wei(l) = . 

. (.S,D):leC^ 



where a^ pis the potential traffic load between S and D. After defining the 

weight for each link, Dijkstra’s shortest path first algorithm is invoked on the 
reduced network. MIRA is compared with min-hop and WSP to verify its 
superiority [8]. It is reported that MIRA significantly outperforms the other 
two routing algorithms. 

MIRA has proved that taking the network topology, traffic load and 
location of each S-D pair into consideration in the routing process, can 
improve the network performance. However, the computation complexity of 
calculating the links with minimum cut between two nodes for determining 
the critical links is 0{N^ ^^fE -No_SD) [10], where No_SD is the number of 
S-D pair in the network, N is the number of nodes, and E is the number of 
links in the network. The above calculation needs to be performed in an on- 
line fashion and may impair network dynamicity. The timely computation 
limits the algorithm in the implementation of networks with dynamic traffic. 

The following descriptions give a high-level view of the MIRA. 

INPUT: 

A network with a set R of all residual link capacities; a node pair {a,b) upon 
which a connection request with flow of B_oc units have to be routed. 
OUTPUT: 

A route between a and b having a capacity of B units of bandwidth. 
ALGORITHM: 

1. Compute the maximum flow values for all (5, D) 6 SD l(a, b) . 

2. Compute the set of critical links Q £, for all (S,D)g SD /(a,b) . 

3. Compute the weights wei(l) = ^0!sd 

{s,d)MC^ 

4. Eliminate all links which have a residual bandwidth of less than B_oc and 
form a reduced network. 

5. Using Dijkstra’s algorithm to compute the shortest path in the reduced 
network using w{l) as the weight on link 1. 

6. Route the demand of B_oc units from a to ^ along this shortest path and 
update the residual capacities. 
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4.2.2 Asynchronous Criticality Avoidance (ACA) Protocol 

This section introduces a signaling/routing protocol, called Asynchronous 
Criticality Avoidance (ACA), with design objectives to reduce fragmentation 
of network resources and improve the overall performance. Unlike MIRA, 
the ACA protocol is implemented with the architecture of Fixed Alternate 
Routing. With the ACA protocol, in addition to using dynamic link-state 
metrics to evaluate each physical route and wavelength plane, the cost of 
taking a lightpath is also determined by how to minimize the interference to 
the future arrival of connection requests, in which “critical channel(s)” are 
defined for each S-D pair. The selection of lightpaths is required to avoid 
reserving these critical channels, which has a similar idea to MIRA. On the 
other hand, the ACA protocol deals with the most of the optimization tasks 
off-line and allows the decision making process completed in a short time 
with small amount of computation. As a result, the problem of dynamic 
RWA can be solved with a better provisioning speed and practical 
implementability. 

In dynamic WDM mesh networks without wavelength conversion, 
spreading traffic homogeneously onto a whole network while avoiding 
resource fragmentation in terms of wavelength chaimel utilization is the 
most critical issue for improving network throughput. The above two tasks 
are also referred to as routing and wavelength assignment processes, 
respectively. In the case where a directional link in the network contains 
multiple fibers, a single wavelength plane along a link in the network 
contains multiple interchangeable wavelength channels during a path 
selection process. In such a circumstance, a dilemma emerges: a “SPREAD” 
of working capacity onto a whole network can achieve better load-balancing 
on a wavelength plane; however, resource fragmentation between different 
wavelength planes may occur. A “PACK” of working capacity can eliminate 
resource fragmentation between different wavelength planes; however, the 
physical load-balancing characteristic is lost. 

Since provisioning latency needs to be less than several hundred milli- 
seconds, the path selection process needs to be very time-efficient, for which 
global optimization is totally impossible. Therefore, most reported schemes 
separate routing and wavelength assignment processes into two steps to 
s imp lify the RWA process (i.e., physical route is selected followed by a 
wavelength assignment process). The ACA protocol contains two basic 
functional elements; inter-arrival planning and a RWA process. The basic 
idea of the ACA protocol is to mi nimize the interference between flows 
issued by different S-D pairs. This topic has been fully explored in the 
previous sub-section, where the optimization of capacity deployment was 
formulated into a multi-commodity maximum flow problem. The main 




93 



contribution of the ACA protocol lies on providing a distribution solution to 
approximate the effect of mutual interference between different flows. 

4.2.2.1 Inter- Arrival Planning 

Inter-arrival planning is defined as planning efforts made between two 
network events (i.e., a request for setting up or tearing down a lightpath) to 
support distributed control process and to increase the probability of 
successfully allocating the next connection call. Successful inter-arrival 
planning starts right after the completion of a network event, and has to be 
finished before the arrival of the next network event. To implement the ACA 
protocol, a series of signaling mechanisms is performed during inter-arrival 
planning. 

We define that a wavelength channel (or a set of wavelength channels) is 
(are) critical to an S-D pair if the occupancy of the wavelength channel (or 
the set of wavelength channels) will block any of the future connection 
requests arriving at this S-D pair. After the completion of settling a network 
event, probing packets are sent by source node along all the APs to all its 
destination nodes defined in its routing table for gathering wavelength 
availability information. After each destination node derives all the per- 
wavelength link-state along its APs, it behaves as a coordinator and 
examines whether or not critical chaimel(s) exist between the source and 
destination nodes. The algorithm to derive the critical channel(s) between a 
source node (i.e., node i) and one of its destination nodes (i.e., node f) 
according to the gathered link-state is presented in the following pseudo- 
code, which is invoked whenever an inter-arrival planning process is 
performed in the network. 



Algorithm: ACA protocol 

We assume that a link contains K independent wavelength planes, each of 
which contains F channels that are interchangeable. 

Input: Critical_Linl^^: a data structure storing the critical channels to the node j 
in the node i, which is null if there is no critical situation; Old_Critical_Linl ^^ : a 
data structure storing the critical channels of the last inter-arrival planning, : 
the threshold of the available lightpaths to the node j. A critical situation is raised if 
the number of available lightpaths to the node j is less than or equal to . 

numjCpath^^ <— 0; /* the numner of total residual capacity {or establishable 
lightpaths) */ 

Old_Critical_Linl^^ <— Critical_Linl^\' 

Critical_Linl^^^ <— null ; 




94 



For m <— 1 to ^ j Do /* for each AP between node i and j */ 

For k <— I to K Do /* for each wavelength plane */ 

{{a, b, k)] <r- Derive the bottleneck on link a-b along the mth AP on 
wavelength plane k\ 

nuntj^ ^ <r— residual capacity (number of establishable lightpaths) on 
wavelength kth along the mth AP; 

If ((a, b, k) 6E CriticalJLinl^^^) Then 
Push {(a, b, k)} to Critical_Lin]^^^\ 

End If 

num_Cpath^^^ mm _Cpath^^^ +numl^\ 

End For 
End For 

If inum __Cpath^^^ > ) Then 

/* the number of total available lightpaths is larger than the threshold, so no 
critical situation exists between node i and j */ 

If (JDldjCriticalJLinlP^ ^ null ) Then 

/* if there was critical situation announced between node i and j in the 
last inter-arrival planning*/ 

Broadcast {OldjCriticalJLinl^^\ false); 

/* release the critical situation between node i and j */ 

/* a “true” is for raising up the critical links, and a “false” is for 
releasing the critical links */ 

Critical_LinJ^^^ null; 

/* reset the stack since there is no critical situation */ 

End If 

Else If {OldjCritical_Lin1^^^ ^ CriticalJLink^^ Then 
/* Criticality exists, and an update of critical links is needed */ 

Derive the new critical links that need to be announced now, newjClink ; 
Derive the old critical links that need to be released now, oldjClink; 

/* For example, in case OldjCritical_Link includes {a-b, kl), (c-d, k2), (e-f, 
k3), and {g-h, k3), while the Critical_Link includes {a-b, kl), (g-h, k4) 
and (m-n, k5), the oldjClink contains {c-d, k2) and {e-f, k3) and the 
newjClink contains {m-n, kS) */ 

If {newjClink ^ null ) Then 
Broadcast {newjClink, numjCpath, true); 

End If 

/* announce the newly generated criticality links to all the other ingresses */ 
If {oldjClink ^ null ) Then 
Broadcast {oldjClink^ false); 

End If 
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/* announce the old criticality links that need to be released to all the other 
ingresses */ 

End If 

With the pseudo-code, node i broadcasts criticality update information to 
all the other end nodes, where the newly emerged and the outdated critical 
channel(s) are announced with specific physical locations and wavelength 
plane(s) as a lightpath is set up or tom down. The threshold of the number of 

feasible lightpaths before criticality is annoimced,r/i^'’^\ is proportional to 
p. j , which is the parameter that directly determines the probability of the 

next connection arrival. 

If the ACA protocol fails to complete before the arrival of the next 
network event, the system has no choice but use stale link-state, which 
impairs the network dynamics when conflicts between two or more resource 
reservation processes occur. A good network design should be such that 
most of the inter-arrival planning processes can be successfully performed 
before the occurrence of the stale link-state problem. 

4.2.3 RWA with Criticality Information 

Each coordinating node receives criticality information from some other 
coordinating nodes. An example of critical channels and criticality 
information dissemination is illustrated in Fig. 4.15. Two APs (i.e., API and 
AP2), each of which contains 5 wavelength channels in the kth wavelength 
plane (or there are 5 fibers in a direction), have been set up between End 1 
and End 2. Assume End 2 is the coordinating node, and = 2 is the 

threshold of criticality. The probing packets are sent by End 1 along the APs, 
and gather the per-wavelength link-state for the S-D pair (End l)-(End 2). 
Assume there are only two available lightpaths on the kth wavelength plane 
left, and all the other wavelength planes along API and AP2 are blocked. 
The critical links in this case are (A-B, C-D)'‘ in API and (F-G)'^ in AP2, 
which will be disseminated by End 2 to all the other coordinating nodes. 
Note that if another S-D pair has no other choice but to route a coimection 
request so as to block the available lightpath along API by taking one of A- 
B and C-D, the lightpath in API is blocked, and the criticality situation along 
API is called off. In other words, the criticality of (A-B)'‘ or (C-D)*' has to be 
called off in the next inter-arrival planning if (C-D)'' or (A-B)" is occupied. 
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Fig. 4.15. An example of critical channels and criticality information 
dissemination. 

We have two steps in perfonning the RWA process with criticality 
information. In the first step, the decision maker only considers all the 
feasible lightpaths that do not take any critical link of the other S-D pair. If 
in the first step the algorithm fails to find even a feasible light path, the 
second step chooses the lightpath with least interference with the residual 
capacity of the other S-D pair. 

The cost of taking a wavelength channel that is not critical is determined 
by the ratio between the number of all wavelength channels and the number 
of residual wavelength channels in the wavelength plane. The cost function 
for ranking the feasible lightpaths between the S-D pair {i,f) is as follows: 



'^m,k ~ 







1=1 



and is defined as: 



AJ) — 



if the wavelength channel is announced critical 
or occupied by a lightpath 

otherwise 



•* t,m,k 






(4.15) 



where is the total cost of the lightpath on the kth wavelength plane 

along the mth AP between node i and j, is the cost of taking the tth 
link on the kth wavelength plane along the mth AP between node i and j. 




97 



is the number of hops along the mth AP between node i and j, Fii^l 
and FC^'^l are the number of fibers and free wavelength channels on the tth 
link of the kth wavelength plane along the mth AP between node i and j. 
Kml spon{a-b) = spanl‘^l ) is the number of wavelength 

channels on the kth wavelength plane along the span a-b. The lightpath to be 
set up is selected from all the feasible lightpaths along all APs and 

wavelength planes (i.e., the lightpath with a cost min imn Cjl'P ), which 

l<k<K 

yields a global optimization in the design space of all APs and wavelength 
planes. 

If the cost function shown in Eq. (4.15) blocks the current connection 
request, we have to take the second step of the RWA algorithm. A feasible 
lightpath that causes a minimum interference with the future possible 
connection requests is chosen. The cost function of a feasible lightpath at 
this stage is shown as follows: 



roji - 

'■'mjc ~ 



E 

( r,5 )6 CriticalitySet 



num _Cpath^''^^ ’ 



(4.16) 



where num _Cpath^'’‘Hs the number of feasible lightpaths between nodes r 
and s, and CriticalSet is the set of S-D pairs whose critical lightpath(s) is(are) 
affected. Eq. (4.16) distinguishes the interference of setting up a lightpath by 
taking the threshold of criticahty (or equivalently the traffic load ^ ) 

into consideration, which is an important indication as to how “critical” the 
occupancy of the wavelength channel is to the S-D pair. The larger the value 
of , the more likely the next connection request will be launched 
between node r and s. In other words, with the same amount of residual 
available lightpaths (i.e., num_CpatH'''’‘\ a larger incurs a higher cost 

proportionally. 

4.2.4 Simulation 



The following simulation is conducted in two different networks (22- 
node and 30-node) as shown in Fig. 4.16(a) and 4.16(b). Every connection 
request is for a bidirectional lightpath on the same wavelength plane. The 
traffic load, yO, , is such that the PB is ranged from 0.01%~20%. To balance 
the tasks of sending/receiving the probing packets and the task of 
coordinating the gathered dynamic per-wavelength link-state, the node 
coordinating the link-state is defined as the end node numbered t) such that 
\v-v\<N ! 2 for any V as the other end node, where the network nodes 
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are numbered from 0 to iV- 1. With this design, every network node can be a 
node coordinating link-state and a node sending probing messages at the 
same time. 




To examine the proposed C-BAR algorithm and the ACA protocol, the 
simulation adopts the following reported RWA schemes for the purpose of 
comparison. 

1. Fixed-Paths Least Congested (FPLC) with 2 link-disjoint shortest paths 
as APs (abbreviated as “FPLC-2D”). 

2. PT*LC with all shortest paths as APs (abbreviated as “FPLC-IS”). 

3. Adaptive Dynamic Routing (abbreviated as “ADR”). 

4. ADR with a length limitation of one hop more than the minimum 
distance on the lightpath selection (abbreviated as “ADR-2S”, where only 
the shortest and the second shortest paths can be possibly selected) 
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5. ADR with a length limitation of two hops more than the minimum 
distance on the lightpath selection (abbreviated as “ADR-3S”, where only 
the shortest, second shortest and the third shortest paths can be possibly 
selected). 

6. C-B AR and Asynchronous Criticality Avoidance (ACA) protocol 
(abbreviated as “ACA & C-BAR”). APs are arranged the same as “C- 
BAR”. The criticality threshold 77t is 1 when Aj j < 7, and 2 otherwise. 

F 

For the ADR, the cost function is: C„h =lH r~> where RB\ is the 

^a,b 

number of available wavelength channels on the kth wavelength plane. 
When a connection request arrives Dijkstra’s algorithm is invoked. A final 
decision will be made by selecting the best lightpath among all the 
wavelength planes. ADR-nS is basically the ADR with a length limit. If the 
length of a derived path exceeds the minimum distance by (n-1) hops a 
blocking is announced and the path will not be taken. For the FPLC, the 
lightpath with the widest bottleneck on a wavelength plane is selected. If two 
or more candidate lightpaths have the same width of bottleneck, the one with 
the shortest distance has a higher selection priority. The FPLC is reported to 
be one of the most efficient wavelength assignment schemes under the Fixed 
Alternate Routing architecture in terms of its ability to deal with path 
selection for multi-fiber WDM wavelength-routed networks. The FPLC 
shows increasingly better performance than other wavelength assignment 
schemes (e.g., minimum Product, Most Used, or Least Used, etc) when the 
number of fibers along a link (i.e., F) is increasing. However, this simulation 
did not use FPLC to examine the cases with longer APs (i.e., paths with non- 
minimum-distance), because FPLC itself does not put any weight on the 
length of a path. As a result, the preparation of long APs may increase the 
average network resources consumed by each connection request, which 
in^airs performance. 

Blocking performance for all the devised schemes is shown in Fig. 4.17. 
The network has 16 wavelength channels in a fiber, and 5 fibers in a 
direction. The number of connection requests for a trial was 600,000, and 
data for each plot was derived by averaging the results of 4 trials. 
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(a) 



Performance Evaluation for Different RWA 
Schemes (30- node) 




Avg. Traffic Load 
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(b) 



Fig. 4.17. Comparison of different RWA schemes with the proposed schemes, ACA 
protocol with C-BAR. 

The simulation results show that the scheme “ACA & C-BAR” 
outperforms all the other schemes significantly. The reason for the 
superiority of the “ACA & C-BAR” scheme can be seen by observing Fig. 
4.18 and Fig. 4.19, in which the average link utilization (C/7) and the 
normalized average extra length (NEL) in each case are demonstrated. UT 
and NEL are defined as follows: 



F L K NE 






UT = 



f I k ne 



NE 



FLK-^i 



where F is the number of fibers along a span, L is the number of links 
(directional) in the network, K is the number of wavelength in a fiber, NE is 
the number of network events during an experimental trial, t„e is the time 
elapsed between the ne-th and the (ne+l)th network events, and <5y ^ ^ „gis a 
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binary parameter, which is 1 if the kth wavelength channel on the^if/i fiber of 
l-th links between the ne-th and the (ne+l)th network events is occupied, 
and 0 otherwise. 



NC NE 



NEL^ ’ - ** ''’J - *- 



JVC -S'. 



where NC is the number of connections launched into the network, hy is the 
hop count of the vth connection, and hy^rntn is the smallest possible hop count 
of the vth connection. 



Avg. Link Utifization vs. Avg. Traffic Load 
Between Each S-D pair (22- node) 
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Fig. 4.18. Link utilization (UT) versus traffic load in the (a) 22-node 
network and (b) 30-node network. 
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Normalized Avg. Extra Hop Count vs. Avg. 
Traffic Load Between Each S-D Pair (30-node) 
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Normalized Avg. Extra Hop Countvs. Avg. Traffic 
Load Between Each S-D Pair (22-node) 
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(b) 



Fig. 4.19. Normalized average extra length (NEL) versus traffic load in 
the (a) 22-node network and (b) 30-node network. 

The best way to improve performance is to increase link utilization (£/7) 
without paying extra costs in average length (NEL). As shown in Fig. 4.18 
and Fig. 4.19, the “ACA & C-BAR” scheme achieved best performance 
because it yielded the highest link utilization of all the schemes while using 
fewer extra network resources for setting up a connection. Fig. 4.20 
demonstrates the improvement by activating the criticality avoidance 
mechanism, which con^ared cases with and without taking the criticality 
information into account. The adoption of the criticality information 
dissemination improves performance especially when the traffic load is low, 
and the chance of blocking can be largely reduced by not taking the critical 
lightpaths of the other S-D pairs. Since NEL should be similar in each case, 
the adoption of the ACA protocol can effectively increase link utilization 
(UT) by reducing the segmentation of network resources. This improvement 
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is at the expense of extensive signaling dissemination and a longer 
computation time. 




Fig. 4.20. Comparison of cases with and without the dissemination of 
criticality information. C-BAR-22 and C-BAR-30 are the cases without 
criticality information dissemination. 



Fig. 4.21 shows the blocking probability for the connection requests upon 
each S-D pairs with different minimum distance in the 30-node network 
when the overall blocking probability is 0.2%, 2%, and 7%, respectively. It 
is clear that after using the C-BAR planning algorithm with the proposed 



methodology for designing j . the difference of blocking probability 



between the S-D pairs with different minimum distance is reduced compared 
with the case using FPLC-2D. 



4.3 Summary 



In this chapter we introduced two planning strategies, namely Weighted 
Network link-state (W-NLS) and Capacity-Balanced Alternate Routing (C- 
BAR), and two routing protocols, the Minimum Interference Routing 
Algorithm (MIRA) and Asynchronous Criticality Avoidance (ACA) protocol. 
The design objective is to construct a framework of performing routing and 
wavelength assignment in WDM wavelength-routed networks with a 
performance behavior that is both time-efficient and capacity-efficient. The 
C-BAR algorithm deploys alternate paths (APs) for each S-D pair into 
networks off-line to pursue the capacity balancing characteristic of APs in 
the networks and the physical diversity of APs across the same S-D pair. A 
design methodology for the number of APs between each S-D pair (i.e., 
with different minimum distance (i.e., is demonstrated under a 
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particular performance requirement. Analytical analysis is conducted for 
examining the proposed plaiming scheme. 





Fig. 4.21. The histogram of blocked connection requests under different 
traffic load: (a) B.P. ~ 0.2%; (b) B.P. ~ 2%; (c) B.P. ~ 7% for a comparison 
between C-BAR and FPLC-2D. In C-BAR, FPLC is adopted for path 
selection. 
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As for the ACA protocol, fully distributed link-state dissemination is 
conducted asynchronously among all nodes coordinating, in which the 
criticality information is disseminated on-demand. The critical information 
defines the critical links between an S-D pair, which is taken into 
consideration during the path selection process for minimizing interference 
from the establishment of a current coimection request upon potential 
subsequent coimection requests. Simulation is conducted in two networks 
with 22 and 30 nodes. The results show that the ACA protocol along with 
the C-BAR planning algorithm can significantly improve performance in 
blocking probability by spreading the traffic homogeneously into the 
networks through well designed APs. We argue that the better performance 
for the “C-BAR & ACA” scheme is due to a better link utilization (UT) and 
smaller normalized extra length (NEL). The simulation results also show that 
the criticality avoidance mechanism reduces blocking probability especially 
when the traffic load is light, with which the interference between the 
lightpaths set up between different S-D pairs can be m i n i mi zed. 

With the C-BAR algorithm and ACA protocol, the network path selection 
leaves most of the optimization tasks performed at the network planning 
stage, while deriving the critical links with a time-efficient algorithm for 
correlating dynamic link-state. As a result, the routing decision can be 
completed in a short period of time. The network performance can be 
improved by the load-balancing characteristic of APs and the criticality 
avoidance mechanism for reducing resource fragmentation. We conclude 
that the C-BAR planning algorithm and the ACA protocol will be able to 
construct a framework for achieving fast provisioning of lightpaths and 
efficient path selection at a single stage. 
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Questions: 

1. How can we derive the potential traffic load across each Source- 
Destination (S-D) pair? 

2. Explain how First Fit, Random Fit, and Fixed Path Least Congested 
wavelength assigiunent schemes work. If the number of alternate paths is 
larger than 1, what is the criterion to determine which route to select? 

3. Do you agree that “with more alternate path pre-planned, the 
performance is further improved”? Explain your answer. 

4. In Fig. Q.4, the potential traffic load for each S-D pair is listed in Table 
Q.4. Assume the potential traffic load of an S-D pair is evenly distributed 
to its alternate paths. Calculate the potential traffic load along each link. 

5. hi Fig. Q.4, assume every link has only a single wavelength plane. What 
is the probability that the link is entirely congested? What is the average 
link utilization for each link? 

6. With the network apparatus shown in Fig. Q.4, and the traffic distribution 
given in Table Q.4, we assume that each connection request is for a 
single lightpath of the same bandwidth. The connection requests follow 
the Poisson arrival and the function of exponential distribution for 
departure. For simplicity, we assume that each directional link can 
accommodate S lightpaths along a fiber. 

(a) Calculate the potential traffic load along each directional link. 

(b) Calculate the probability of occupancy along each directional link 
(Hint: this is an M/M/K/K lossy system)). 

(c) Calculate the blocking probability for each S-D pair (Hint: every link 
has different probability of congestion, therefore, Eq. (4.5) may not be 
feasible). 

(d) Assume the departure rate for each connection request is no longer 
unity, instead, ^ = 2, 3, 2, 4, 1, and 3 for each connection from top of 
the table. Repeat (a) and (b) (Hint: calculate the average arrival rate 
and average departure rate for each link instead of using potential 
traffic load). 
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Fig. Q.4. 
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Table Q.4. 



1. The arrangement of alternate paths for each S-D pair in Q.6 is not 
capacity-balanced. Formulate a QIP to optimize the deployment of 
alternate paths for the network described in Fig. Q.4 and Table Q.4. 

8. In Q.7, after the programming, assume we have the potential load Wa,b 
such that: 

w fc — w 

0< ■ — <20%, for all adjacent (a, &). 

w 

We also assume that the length of each alternate path is the same before 
and after the programming. What is the upper bound and lower bound on 
the blocking probability of each S-D pair? 

9. Formulate the following plaiming problem using QIP: Each S-D pair has 
only a single shortest path pre-scheduled for the path selection (this is 
also called Fixed Routing). The optimization target is to minimize the 
deviation of the potential traffic load across each directional link in the 
network. Do you expect to see the optimization results better or worse 
than the cases with multiple alternate paths between each S-D pair? 

10. In the approach for designing the number of alternate paths, the original 
point is set such that the longest-distant S-D pairs are assigned 4 link- 
disjoint APs. Why do not we make the number of alternate paths for 
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those S-D pairs larger instead of only assigning 4 alternate paths to 
improve the total performance? 

11. What is the purpose of estimating the utilization across each wavelength 
link in the network? Can we assume that the utilization is very high by 
deploying more APs between each S-D pair? 

12. ACA protocol is a simplified version of Maximum Sum (MS) [11]. With 
the MS scheme, the total residual working capacity (i.e., the number of 
total lightpaths that can be set up in the network) is calculated right after 
any connection request is settled. The lightpath that interferes with the 
least amount of working capacity is preferred. For a distributed control, 
what information of link-state does it need to be disseminated? Compare 
MS and ACA in terms of the amount of dissemination. 

13. Explain why Criticality Avoidance Routing has a much better 
performance than FPLC and ADR in some cases. Based on the fact, list 
three most important factors impairing a multi-wavelength WDM 
network. 

14. Explain UT, NEL and PO. 




Chapter 5 



ROUTING AND WAVELENGTH 
ASSIGNMENT WITH MULTI-GRANULARITY 
OXCS 



The functional architecture and basic design principles for Multi- 
Granularity OXC (MG-OXC) networks have been introduced in Chapter 2. 
This chapter provides two dynamic algorithms for solving the problem of 
routing and wavelength assignment with tunnel allocation, namely Dynamic 
Tunnel Allocation and Capacity-Balanced Static Tunnel Allocation schemes. 
In addition to the routing algorithms, we will present their performance 
evaluation in two sample networks. 

5.1 Introduction 

The advantages of using Multi-Granularity OXCs (or MG-OXCs) include 
reducing hardware cost and achieving hierarchical traffic in the optical layer. 
In the meantime, however, it also puts extra constraints on the path selection 
process. This chapter deals with the problem of routing and wavelength 
assignment in networks with MG OXCs. 

In the networks with MG-OXCs (also termed MG-OXC networks), each 
fiber along a directional link has a specific attribute: either fiber-, waveband- 
or lambda-switched. The two layers with tunnels are also called tunnel 
layers. A lightpath can traverse through the waveband- or fiber-switched 
layers if and only if a tunnel at a proper location is set up in advance. As 
mentioned in Chapter 2, a tunnel in the MG-OXC networks is a logical 
conduit containing a group of consecutive wavelengths. A logical conduit is 




110 



supported by the configuration in switching fabrics of one or several MG- 
OXCs. This apparatus is equivalent to put an extra constraint on the routing 
of lightpaths in the tunnel layers: all hghtpaths in a tunnel have to be 
switched together. 

Our design objective is to configure the switching fabrics in each tuimel 
layer so that there can be as many lightpaths coexisting in the network as 
possible. This task also refers to tunnel allocation. We can either 
dynamically allocate tuimels into the tuimel layers as lightpath setup request 
arrives according to the current link-state, or allocate all possible tunnels into 
networks before any traffic is launched (or at the network planning stage). In 
the latter, lightpaths are allocated into the networks where the tunnel layers 
are configured with pre-defined tunnels. In the former, on the other hand, 
tunnels can be newly built or tom down for a specific lightpath setup or 
release request. We name the process as RWA with tunnel allocation. Based 
on the above two principles, two strategies for performing RWA with tunnel 
allocation are presented in the rest of this chapter: Dynamic RWA with 
Tunnel Allocation (DTA), and Capacity-Balanced Static Tunnel Allocation 
(CB-STA). 

In the DTA, tunnels are real-time allocated into networks according to the 
arrival of connection requests. The dynamic selection of a lightpath needs to 
consider both the existing tunnels that can possibly be taken as well as any 
possibility to further establish a new tunnel for the lightpath. We use Fixed 
Alternate Routing with k-shortest path set between each S-D pair to search 
an eligible lightpath. The design requirements of DTA emphasize the ability 
to swiftly adapt to traffic changes by setting up and tearing down tunnels 
according to real-time traffic. 

The second scheme, CB-STA, allocates tunnels in an off-line manner 
according to the Weighted Network Link-state (W-NLS) defined in Chapter 
4, which is used for balancing the potential traffic load. After tunnels are 
allocated into the network, Dijkstra’s shortest path first algorithm is 
performed for each connection request arriving one-by-one on the network 
topology configured with the pre-scheduled tunnels. Since we can put some 
constraints on the allocation of tunnels (e.g., a length limit and load- 
balancing characteristic) so that the tunnel locations and lengths are well 
disciplined, CB-STA generally outperforms DTA. However, CB-STA adapts 
poorly to traffic variation due to the static design approach. 

We will also investigate a combination of the above two approaches 
(termed “mixed” in the following context), in which the network is deployed 
with tunnels that will never be tom down, while dynamic allocation of 
tunnels is allowed and performed during normal operation. 

The basic design principle for the control and management of MG-OXCs 
is such that the lambda-switching channels are used mainly for packet- 
switching, layer-2 switched and TDM data (which may be the accessing 
traffic), protection, or transition between two tunnels, instead of forwarding 
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traffic. Traffic bypassing through a node should be bundled into tunnels as 
much as possible. We list a brief summary on the above descriptions for the 
operation in the MG-OXC networks as follows. 

1. A certain percentage of fibers along a directional link are assigned as 
fiber-switched, waveband-switched or lambda-switched fibers. 

2. In a waveband or fiber-switched fiber, all the wavelength chaimels in a 
waveband or a fiber have to be switched together. 

3. A connection can take a tunnel (either a waveband or a fiber tunnel) if 
and only if it takes a path that traverses the ingress and egress nodes of 
the tunnel. 

4. The connection must be lambda-switched at both the source and the 
destination node of a lightpath. 

5. A lightpath may take multiple tunnels between source and destination. A 
transition (i.e., a lambda-switching) is required when a lightpath traverses 
between any two tuimels so that the other traffic can be grouped or de- 
grouped as well. 

Fig. 5.1 shows an example of a switching process of a lightpath with the 
MG-OXC architecture. There are two tunnels shown here: the first tunnel is 
a fiber tuimel from node 0 to node 4; and the second tuimel is a waveband 
tunnel from node 4 to node 6. Note that the switching type of a fiber in a 
node is determined by the selectors as shown in Fig. 2.6. The FS selector 
determines whether the input fiber is fiber-switched or waveband/lambda- 
switched. The WBS selector determines whether the input fiber is 
waveband-switched or lambda-switched. The WBS space switching box 
exchanges wavebands between input fibers. The FS space switching box 
exchanges a whole fiber of data flow between two fibers. It is clear that the 
resource of lambda-switching boxes along these nodes is consumed at only 
node 0, node 4 and node 6. 




Traffic 

leaves 



Fig. 5.1. Switching process of a lightpath which is sourced at node 0 and destined 
at node 6. 
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5.2 Dynamic Tunnel AUocation with Path Selection 

The algorithm for the implementation of DTA with path selection is 
briefed as follows: 

1. With FAR, derive a combination of feasible tuimels along an alternate 
path under inspection (including the existing tunnels and establisheable 
tuimels), such that a lightpath can be built from the source to the 
destination. 

2. Try all the feasible alternate paths and derive an optimal combination of 
tunnels along a path. First-Fit wavelength assignment scheme is adopted. 

The grooming strategy for dynamic allocation is given as follows: 

1. Priority: use an existing fiber tunnel > use an existing band tunnel > 
create a new fiber tunnel > create a new waveband tunnel > use a lambda 
switched channel. 

2. If a tunnel is registered as “long-life”, it will never be tom down 
regardless of whether or not there is a lightpath using the tunnel; 
otherwise, it is tom down whenever there is no lightpath traversing 
through it. 

To increase the utilization of fibers dedicated for tunnel layers, the link 
cost with different switching types should be such 
ih'sXcQ.arnbdd) » c(waveband) > c( fiber ) . The cost of the t-th timnel along 
the ith alternate path between the mth S-D pair is: 

CTU=Km-c{S), 

where is the hop count of the t-th tunnel along the ith alternate path, and 

S is the switching type, which can be either lambda, waveband or fiber. 
Note that a tunnel is assumed to be non-wavelength-specific with enough 
wavelength conversion capability at the ingress and egress nodes of the 
tunnel. Therefore, the cost of the ith path is: 

CPL = Xcr.!. + , 

t t 



where Hi, is the hop count of the ith alternate path between the mth S-D pair. 
The final decision is made by taking the lightpath on the first available 
wavelength plane along the ith alternate path with the smallest CP^, where i 

= 0 ~ /m, and i is the number of alternate paths for the mth S-D pair. 

Fig. 5.2 shows an example of tunnel allocation process along an alternate 
path with the source at node 0 and the destination at node 6. There are five 
fibers along a directional link where Fj = 1, F 2 = 2 and Fj = 2. The optical 
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traffic is launched on the transport layer via lambda switching at node 0. The 
only fiber available for the lightpath is the second fiber. To allocate a 
feasible combination of tuimels, consider the 4 existing tunnels (A, B, C, and 
D) and 2 establisheable tuimels (E and F). The optimal tunnel allocation for 
this particular path can be one of 2® = 64 combinations. Fortunately, some 
of the combinations may not be feasible. For example, tunnel A and B 
should not be considered at the same time because they overlap with each 
other. Another example is that the establisheable tunnel F is exclusive to the 
fiber tunnel D. After trying all the possible combinations of tunnels and 
wavelength channels, the optimal solution can be derived. In this case, the 
optimal lightpath goes from the second fiber at node 0, and joins fiber tunnel 
D, and then goes out of D at node 4 to join waveband tunnel C. 



No^ 



Node I Node2 Node3 



Node4 Nodes 



Nodes 




Lambda channels 



WB tunnels 
Fiber tunnel 



registered as unusable □“ 
re|istered as usable fiber t unn el 



registered as usable band tunnel 



Fig. 5.2. An example of dynamic tunnel allocation with FAR. 

The dynamic tunnel allocation is exponential in coirplexity since the 

number of possible combinations of tunnel allocation is proportional to 2" , 
where n is the number of feasible tunnels along an alternate path. If the 
number of fibers in a link and the size of the network are both small, then 
deriving the optimal combination of tunnels can be done by selecting the 
best among all possible solutions. However, if the network is large, the 
number of feasible tunnels residing along an alternate path can also be large, 
which may result in a huge amount of computation time. In this case, a 
heuristic is needed to derive a sub-optimal solution so that the computation 
efficiency is improved. For this purpose, we propose a greedy heuristic that 
keeps the longest and most valuable tunnels first and discards other tunnels 
that conflict with the selected tunnels. The selection of the t-th tunnel along 
the ith alternate path between the mth S-D pair is based on another cost 
function: 
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ctU = (K-KJ-<s), 

The flowchart describing the algorithm is shown in Fig. 5.3. 




Fig. 5.3. Flowchart for describing the dynamic tunnel allocation algorithm. In 
each iteration, the wavelength has to be specified because the concatenation of 
lambda-switched channels is subject to the wavelength continuity constraint. 



5.3 Capacity-Balanced Static Tunnel Allocation 

The purpose of CB-STA is to allocate tunnels into networks at the 
planning stage (or off-line). Unlike DTA, the tunnels built-in a priori will be 
marked as “long life” and never be tom down while the network is running. 
The task of allocating tunnels into tunnel layers can be sub-divided into two 
sub-tasks; 

1. Determine a tunnel ingress-egress (I-E) pair for each tuimel based on W- 
NLS. 

2. Route for each tuimel I-E pair according to the information in the first 
task. 
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For the first task, we define the potential and sink of a node, P„, and S/„, to 
quantify the tendency for the nth node to initiate and terminate lightpaths: 

= E ^n.r M all « 

ren 

Sin = E ^r.n M all n 

ren 



where w„_r is the W-NLS between node n and node r, and n is defined as the 
node set containing all the nodes adjacent to node n. The node with the 
largest potential has the highest priority to be assigned as ingress of a tunnel; 
and the node with the largest sink has the highest priority to be assigned as 
egress of a tunnel. After a tunnel is set up between an I-E pair, an adjustment 
is made in such a way that the potential of the ingress and its neighbor nodes 

are decreased by d(5)and.^l^, and the sink of the egress and its neighbor 

nd 

nodes, are decreased by <5(S) and jn above description, the 

nd 

deviation (5(5) is a function of the switching type: 
S {fiber) = 6 {waveband )xB , where B is the number of wavebands in a 



fiber, and nd is the nodal degree of the ingress or egress node of a tunnel. 

We impose another constraint on the hop count of the tunnels to be the 
smallest integer that is larger than the average distance between each S-D 
pair in the network. The number of the tunnel I-E pairs generated at the 
beginnin g of the planning process is the maximum number of tunnels that 

the network can accommodate. For example, for a network with IlI 



directional links where L is the set of all directional links, the upper limit on 
the number of tunnels in switching layer 5 are 



NO^per{S) = 



|L|-SL(5)-ArF(5) 

DT 



where 5L(5) is the number of sub-layers for 



a layer of switching type 5, DT is the length of a tunnel in hop count, and 
NF{S) is the number of fibers dedicated to the switching layer 5. If there are 
Fi fibers for fiber-switching and F 2 fibers for waveband-switching, we get 
SUfiber) = 1 and SUwaveband) = B-, NFifiber) = F/ and NF{waveband) = F 2 . 
After the algorithm generates all fiber-tunnels, the values of potential and 
sink for each node (i.e., F„ and 5/„) are recorded and used for the generation 
of waveband-tunnels. Given the above, the deviation of(5(5)is customized 



as S{S) = ^ ^ where P and SI are 

{fiber) + {waveband) jB 

the average values of potential and sink at the beginning of tunnel allocation. 
Since each sub-layer is independent from the others for waveband tunnels. 
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the upper bound on the number of tunnels to be inspected is 



K 

DT 






The completion of the first task generates a series of I-E pairs sorted 
according to their importance to network load-balancing (we term the set of 
I-E pairs as “series” in the following context). The series of I-E pairs may 
overload the network since the number of tunnels in the series is an upper 
limit by estimation of NOupper- The problem can be formulated into an 
Integer Programming (InP) process in which the algorithm allocates as many 
tunnels as possible into the network. We assume that every node pair is 
prepared with all shortest paths to every other node. 



Target: maximize T (number of I-E pairs stored in the series), 



Subject to the following constraints: 

for t = \~T (5.1) 

for t = + (5.2) 

T ^ 

for ( 5 . 3 ) 

t=\ i=l 




where T is the number of I-E pairs from the top of the series to be allocated; 
NSt is the number of the shortest paths prepared for the t-th I-E pair in the 
series; is a binary decision index, which yields a “1” if the sth shortest 
path of the t-th I-E pair is taken, and a “0” otherwise; and is a binary 

decision index, which yields a “1” if the sth shortest path of the t-th I-E pair 
traverses span /, and a “0” otherwise. In the above formulation, Eq. (5.1) 
puts a constraint on the number of shortest paths taken by an I-E pair to be 1. 
Note that there may be more than one tuimel established between a node pair, 
and therefore the number I-E pairs dedicated to the same node pair may be 
larger than 1. Eq. (5.2) states that the I-E pairs in the series with a sequence 
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number larger than T will not be assigned a tunnel. Eq. (5.3) puts a constraint 
on the capacity along each directional link. 

The above formulation is subject to an NP-hard computation complexity. 
A heuristic approach is needed to trade performance with computation 
complexity, where each I-E pair is sequentially allocated with a tunnel 
instead of being optimized as a whole. The FF scheme [1] is adopted by 
assigning the first available sub-layer of the switching layer to a tunnel. 
Unlike the InP formulation, the algorithm skips any blocked I-E pair, and 
routes for the next I-E pair in the series. The algorithm continues until all the 
I-E pairs are tried. The computation complexity becomes 

w 

• (Fj + 5 • F 2 ) • iV ) in perfomung Dijkstra’s algorithm. 

After the network is allocated with tunnels, a make-up process is 
conducted to fill each fiber-switching and waveband-switching layer with 
tunnels as much as possible. An inspection is then made at a node along all 
the shortest paths to every other node to see if there is any possibility of 
more tunnels being set up. The inspection must follow the sequence of I-E 
pairs according to the value of F„ and 5/„ (node-pairs with larger values of P„ 
+ Sin are prioritized). The database for keeping P„ and SI„ is updated 
whenever a make-up tunnel is allocated. The make-up mechanism aims to 
increase link utilization in the fiber- and waveband-switching layers. Note 
that the links that are not traversed by any tunnel in the two layers are 
completely wasted. The make-up process can be iterated through all the 
network nodes of each layer until no tunnel can be built, or when 100% of 
tunnel coverage is achieved. The flowchart of the whole algorithm is shown 
in Fig. 5.4. 

The cost matrix CM_t that takes tunnels into consideration can be derived 
with the following pseudo-code: 

Input: CM : the cost matrix without considering tunnels; Tu : all the fiber- 
and waveband-switched tunnels. \Tu\: the number of elements in Tu. CF, 
CB and CL; the unit cost for a fiber-, waveband-, and lambda-switched 
channel, respectively, which are defined here as 1, fi and K. 



CMJ<-CM-, 



For i <— 1 to A Do 
For r <— 1 to |Tm| Do 

If (node i is the ingress of the tiuinel Tu[t] with an egress node e) 

Then 

If (CM\i][e] = 00 ) Then 
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/* if node i and e are not adjacent so that there can be set up a 
tunnel; */ 

If (Tu[t] is a fiber tunnel) 
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Then 

CM _t[i][e] < '^LESjHi,e)-l) + CL 

B ■ Num(Tu[t],e, fiber) • + Num(Tu[t], e, waveband ) • b^^ 

I* for a fiber tunnel */ 



Else 

CM _t[i][e] i Cg( / engt/iO,e)-l) + CL 

B • Num(Tu[t], e, fiber) • + Num(Tu[t], e, waveband ) • 6,. ^ 

/* for a waveband tunnel */ 

End If; 

End If; 

End For; 

End For; 

Retnrn CM_t ; 

Output: The reconfigured cost matrix M that considers tunnels. 

In the above, the function length(i,e) and Num(Tu[t], i, e, S) returns the 
minimum distance and the number of tunnels with a switching type S from 
node i to node e, respectively. The symbols f ^ and represent the 

number of free wavelength channels in the fiber, and the waveband tunnels 
connecting between i and e, respectively. These are dynamic link metrics 
and need to be updated whenever any lightpath is built-up or tom down. Fig. 

5.5 is an example showing a logical connection of tunnels between two 
nodes. The cost between the two nodes CM_t[A][B] is a parallel effect of all 
the fiber and waveband tunnels between them, which is d)mamically 
changed with traffic distribution. 




Fig. 5.5. An example illustrating the cost between two nodes connected by tunnels. 
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A tunnel is brought up if and only if both of the following resources are 
available: the lambda-switching capability between the tunnel ingress to the 
second node, and any free channel along the tunnel from the second node to 
the tunnel egress. An example is shown in Fig. 5.6(a). A connection request 
for a lightpath from G to A arrives. A fiber tunnel from F to A is available to 
the connection request if the lambda-switched capacity from F to C and any 
one of the free chaimels along the fiber tuimel are available. The virtual 
topology is shown in Fig. 5.6(b), where the tunnel is expressed as an extra 
link from F to A with a much cheaper cost. Whenever the tuimel is taken, a 
lambda-switched channel from F and C is also taken and updated in the link- 
state database. If there is only a single lambda-switched channel available 
along the link F-C, both the tunnel from F to A and the lambda-switching 
capacity between F to C can be announced as available without a conflict 
since the hghtpath can never be built in such a way as (G,F,C,D,E,F,C,B,A), 
which traverses link F-C twice. 




Fig. 5.6. An example for the policy of availability announcement during the static 
tunnel allocation process; (a) physical location of a tunnel, (b) logical expression of 
the tunnel. 

After deriving the cost matrix CM_t, Dijkstra’s algorithm is performed on 
each wavelength plane for each connection request based on the modified 
network topology and link-state using the pseudo-code in this section. The 
lightpath with the least cost among all the wavelength planes will be taken. 
If two or more equally efficient lightpaths are found, the one with the 
smaller sequence number of wavelength planes is taken. 

5.4 Simulation 

We provide a performance evaluation for the proposed algorithms on the 
16-node network of Fig. 4.2 and a 22-node network shown in Fig. 4. 16(a). 
We assumed that the number of wavelengths in a fiber is 64, and the number 
of fibers along a directional link is 5. There were 8 wavebands in a 
waveband switched fiber: the to 8* lambdas were in the first waveband. 
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the 9* to 16* lambdas were in the second waveband, . . and the 57* to 64* 
lambdas were in the 8* waveband. We assumed that every node had the 
same percentage of capacity for each switching type (i.e., the switching 
capacity dedicated to a specific switching type was the same from node to 
node). 

The simulation results make it clear that ADR-5L yields the best 
performance because there is no constraint in using any fiber along a link. 
This can be observed in three ways: firstly, with the percentage of lambda 
switching increased, the blocking probability is reduced; secondly, with the 
make-up tunnels, CB-STA has gained only a small amount of performance 
improvement; and thirdly with the same amount of lambda switching 
capability, performance is impaired with more fiber switching capability. 
The “Mixed!' yields better performance especially when the percentage of 
tunnel layers is getting larger. 

Table 5.1 shows some statistics of resource utilization in each scheme. UT 
(link utilization) is defined as the average percentage of wavelength channels 
in a specific switching layer to be occupied by a lightpath. PC (percentage of 
coverage) is the average percentage of network resources in a tunnel layer 
covered by tunnels so that the links are utilizable. We also 

define iVEL = (ratio of extra length), where NC is the 

V hop^Mn'^C 

total number of connections in the network, hopv is the length of the vth 
connection, and hopy „,i„ is the minimum distance of the S-D pair for the vth 
connection. NEL measures the extra percentage of network resources 
consumed by a single connection request. A larger NEL impairs performance 
since more network resources (i.e., hops) are consumed by a single 
connection. All the data are measured and averaged at the arrival of the 
20,000*, 30,000*, 40,000*, 50,000* and 60,000* connection request. 
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Blocking Probability vs. Traffic Load 




-DTA 

^C6-STA 
CB'STA w/i 

-ADR-5L 



(0 





Fig. 5.7. Blocking probability versus traffic load with different percentages of 
switching types in the 16-node and 22-node networks. CB-STA w/o: CB-STA tunnel 
allocation without make-up process. 
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The DTA scheme is outperformed by all other schemes at all traffic load 
conditions, percentages of switching types and network topology adopted in 
this simulation. There are two reasons for the inferiority of DTA. First, the 
tunnels in the dynamic scheme are established according to the random 
arrival of coimection requests. As a result, the tunnels may be 
inappropriately long or short, and fragmentation may occur easily. The short 
tunnels (e.g., tunnels with hop count of 1) generated in DTA are hard to be 
erased because the shorter a tuimel is, the easier it can be fitted into a 
lightpath. The short tunnels may fragment network resources and prevent 
long tunnels from being established. In that event, the link utilization in the 
tunnel layers is reduced, which may impair performance. Table 5.1 shows 
that with the DTA scheme a lesser percentage of network resources are 
covered with tunnels than is the case using the other schemes. Around 93% 
of capacity in the tunnel layers is covered with tunnels when using CB-STA 
scheme, while the DTA yields an average of around 80% coverage. In other 
words, an average of 20% of network resources in the tunnel layers is totally 
wasted due to the absence of tunnels. 

Second, the CB-STA scheme allows a path with loops to be set up while 
the DTA scheme does not. An example is shown in Fig. 5.6. With DTA, the 
connection request arisen at B or C can never take the fiber tunnel. On the 
other hand, with CB-STA or Mixed, a lightpath may take a lambda-switched 
channel and tuimeled chaimel along the same physical link. In Fig. 5.6, a 
connection request from node B to node C can take a lambda-switched 
channel on link (B-A), get on the fiber tunnel (A,B,C,F) to node F, and then 
take lambda-switching chaimel on link (F-C) to node C. Therefore, the use 
of a tunneled network link-state in CB-STA and Mixed makes the utilization 
of tunnels more flexible. 

“Mixed” outperforms CB-STA due to its smaller EL and larger value of 
UT. The dynamic establishment of tunnels in “Mixed” rather than a random 
make-up in the tunnel layers by DTA facilitates the adaptation to traffic 
variation. “Mixed” marks some tunnels with well-disciplined characteristics 
(i.e., load balancing and length limitation) as “long life” while allowing 
dynamical setup/tearing down of some others, and yields the best 
performance. We also observe in Fig. 5.7 that the randomly generated make- 
up tunnels can improve performance only to a very limited extent. 
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5.5 Summary 

In this Chapter, two algorithms were introduced for solving the RWA 
problem with tunnel allocation, namely the Dynamic Tunnel Allocation with 
path selection (DTA) and Capacity-Balanced Static Tuimel Allocation (CB- 
STA). DTA allocates tunnels into networks according to dynamically arrived 
coimection requests, which can adapt to traffic variation swiftly, but may not 
guarantee that each tunnel is at a proper location and with a proper length. 
On the other hand, CB-STA achieves capacity-balancing by allocating 
“long-life’ tunnels into networks at the planning stage (or off-line) based on 
Weighted Network Link-state (W-NLS), in which tunnels have a fixed 
length at proper locations in the network. This chapter also verified an 
approach that is a combination of the above two algorithms known here as 
“Mixed”, in which networks are deployed with “long life” tunnels 
determined by CB-STA while a dynamic setup of tunnels is allowed using 
DTA. The simulation results show that with the same experimental apparatus 
and assumptions, DTA is outperformed by the other schemes due to its 
unscheduled way of allocation that may yield short or improperly positioned 
tunnels. Statistics are provided with detailed discussions for investigating the 
characteristics of each scheme. 
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Questions: 

1. Why is the performance impaired when we use MG-OXCs instead of the 
conventional lambda-switched OXCs, even if the same amount of fibers 
are on each directional link? What are the benefits to be derived in 
adopting MG-OXCs? Explain your answer. 

2. MG-OXCs can provision multi-granularity traffic in the optical domain 
through a hardware-based approach. The OXC network is hardware- 
based because each MG-OXC dedicates a certain percentage of fibers 
along a link to a specific switching type. Do you think we can provision 
multi-granularity traffic through a software-based approach? Roughly 
devise a RWA scheme for it. 
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3. What are the pros and cons of using DTA and CB-STA? Why may the 
mix of the two achieve better performance? 




Chapter 6 



PROTECTION AND RESTORATION 



This chapter presents advances in the state-of-the-art of enabling 
technology for the survivable optical Internet, in which overviews on a 
variety of proposals for protection and restoration mechanisms are provided. 
In Section 6.1, we define necessary terminology that will be useful in the rest 
of this book. The design principles and objectives for achieving network 
survivability are presented. In Section 6.2, protection and restoration 
schemes that have been reported for ring-based non-WDM optical networks 
are introduced. Topics include the Automatic Protection Switching (APS) 
and the SONET self healing ring, which have been well standardized in 
industry. In Section 6.3, protection schemes for mesh WDM networks are 
discussed, and protocols and signaling mechanisms for performing 
restoration are addressed. The Short Leap Shared Protection (SLSP) 
framework, which is designed for performing an end-to-end service 
guaranteed shared protection, is discussed in detail. SLSP framework will 
also be taken as the basis of one of the main topics in Chapter 7 and Chapter 
8 . 



6.1 Network Survivability 

Due to an exponentially increasing amount of bandwidth carried in a 
single fiber and the emergence of versatile multimedia applications for 
delay-sensitive commercial purposes with stringent quality of service 
requirements, the occurrence of a network failure (e.g., fiber cut) may cause 
serious and even fatal impairment on the service integrity to innumerable 
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end users. Therefore, network survivability and service continuity have 
become the most important issues for optical networks. The most common 
form of communication failure is an accidental disruption of communication 
conduits of fibers buried underground, which may impair huge amount of 
bandwidth in transmission. To overcome this inevitable accident situation, a 
most commonly adopted method is to pre-plan or even pre-configure 
protection resources for each working path so that the traffic influenced by 
the failure can be switched over to the spare resources to maintain service 
continuity. 

A network is considered to be survivable if it can always maintain service 
continuity to end users. We define protection as an operation performed in 
advance of a failure, which is intended to defend the network against any 
possible disruption. Protection includes failure detection and localization, 
and also any signaling mechanism to notify the existence of the failure to the 
whole network. On the other hand, restoration is defined as the operation 
conducted after any failure occurs, which recovers the affected traffic and 
reinstates service continuity. Restoration includes all signaling mechanisms 
and network reconfiguration to recover the original traffic flow from the 
failure. A working path (or primary path) is defined as a lightpath that is 
selected for transmitting data during the normal operation. A protection path 
(or spare path, secondary path) is the path used to protect a specific segment 
of working path(s). 

For achieving survivability in the optical ' Internet, network capacity is 
generally overbuilt in comparison with average traffic volumes of current 
users, in order to support fluctuations in traffic levels and to stay ahead of 
traffic growth rates. With under-utilized capacity in networks, a most widely 
recognized strategy is to find protection resources that are physically 
disjointed (or diversely routed) from the working paths, over which the data 
flow could be switched to the protection paths during any failure of network 
elements along the working paths. 

In this book, protection and restoration schemes are evaluated according 
to the following five criteria: scalability, dynamicity, class of service, 
capacity-efficiency, and restoration time. The scalability of a scheme is 
important in the sense that computation complexity of network 
protocols/algorithms should be kept low with respect to any expansion in the 
number of nodes or links or bandwidth of each directional link. A scalable 
design ensures that control protocols/algorithms can be applied to networks 
that are large in both size and capacity, and can be applicable to the 
management of networks with a higher traffic changing rate. The issues of 
dynamicity are usually accompanied with the network scalability, which 
defines the sensitivity of the control protocols/algorithms reacting to traffic 
changes. The next generation optical Internet will dynamically provision 




131 



lightpaths, where connection requests arrive at the networks one after the 
other without any prior knowledge of future arrival. In this case, the issue of 
dynamicity is getting important. 

Networks with class of service can provide protection and restoration 
mechanisms for bandwidth provisioning to the end users according to 
Service Level Agreement. We should design the Internet restoration services 
with a wide spectrum and fine granularity so that network resources can be 
allocated in a most efficient way. The capacity-efficiency is concerned with 
the efforts to make a most use of every piece of network resources. To 
improve network capacity-efficiency, the schemes must make the most use 
of all spare capacity through any optimization process or heuristic. 
Restoration time describes how fast a failure can be recovered from an 
interruption status. In general, restoration speed, capacity efficiency, and 
dynamicity are tradeoffs in the design spectrum of network protection and 
restoration schemes most of the time. 

Pre-planned restoration is the allocation of protection resources for 
working paths before any failure occurs to the network, which can guarantee 
restoration service and restoration speed to a specific working path 
compared with real-time searching schemes (i.e., alternative paths are 
searched for the working capacity affected by failure right after the failure 
occurs). Due to the fact that failure on a single network element (e.g., a fiber 
cut or node failure) may interrupt a huge amount of bandwidth in 
transmission, the pre-planning of protection paths for the working capacity 
with a high quality-of-service requirement is the most feasible way. On the 
other hand, the real-time-searching protection finds a protection path for any 
affected working path after the failure is detected, where the protection path 
should be physically disjointed from the original working path. With the 
real-time-searching scheme, since there is no pre-configured or pre-planned 
spare capacity allocated in the networks, the throughput will not be 
impaired; however, this is at an expense of taking a risk that no protection 
path can be found for a specific working path. 

In the following context, protection resources or spare capacity depicts 
wavelength channels reserved by one or several working paths along a 
protection path or backup path. Protection path or backup path depicts a 
segment of path allocated for protection purposes. When network resources 
(e.g.., wavelength channels) are reserved for protection, it cannot be used by 
any other working path except for the best-effort traffic. Note that the best- 
effort traffic will be interrupted whenever the protection resources are 
requested by the quality-of-service working paths which are subjected to 
failure. 

According to the number of working paths protected by the same 
protection resource, the protection operation can be either dedicated 
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protection or shared protection. The dedicated protection, as shown in Fig. 
6.1(a), has two paths coimecting the source and destination nodes, and each 
of the two paths carries the same copy of data. For shared protection, 
network resources along a protection path can be reserved by multiple 
working paths for the protection purpose as shown in Fig. 6.1(b). In this 
case, more restoration time is consumed by extra signaling mechanisms and 
the configuration of devices. However, the shared protection yields much 
better capacity-efficiency by sharing protection resources between different 
working paths. 




Working path 



Protection path / 



Working 




(b) 



Fig. 6.1. (a) Dedicated protection; (b) Shared protection. 

For the shared protection, depending on whether or not protection 
resources are dedicated to a specific group of physical links, they can be 
either pre-planned or pre-configured. The differences between the two are 
based on whether configiu^tion is needed for the network resources along 
the protection paths before data flow can be switched over, and whether the 
best-effort traffic flow can make use of these protection resources at normal 
operation. As an example shown in Fig. 6.1(b), node D and node D’ are 
configured right after a failure occurs on either one of the three working 
paths. Therefore, the protection resources are only pre-planned instead of 
pre-configured. 

For dedicated protection, only a very small amount of signaling efforts 
are required since the restoration process is held within the destination node. 
For shared protection, on the other hand, a suite of signaling mechanisms is 
performed after the occurrence of a failure, which is described as follows. If 
a fault occurs on a working path, its destination node in its control plane 
realizes the fault and sends a notification indicator signal to the source node 
of the working path to activate a switchover. Then, the source node 
immediately sends a wake-up packet to activate the configuration of all 
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nodes along the protection path and then switch over the whole traffic on the 
working path to the protection path. 

Shared protection takes longer restoration time than that of dedicated 
protection for configuring the optical network elements along the protection 
paths and processing the control messages in each protection node. In 
general, the restoration time Tr, is defined as: 

Tr — Tsignaling + Tconfig T^tection » 

where TdetecHm is the time taken for failure detection, Tsig„au„g is for signaling 
propagation and node processing, and Tcon/ig is for configuring the optical 
network elements along the protection path. If we consider only the impact 
of routing strategy to the network performance, the restoration time Tr is 
totally contributed by the propagation delay, and is modeled as: 



rp working protection p p 

R ^ ^detection ^config ■ 



where u is the speed of light in a specific medium. Without loss of 
generality, every link has a distance of 1 unit. Note that the signaling 
processing time in each node is neglected in the above expression; and the 
time for configuring the protection resources is independent of the length of 
the working and protection paths since the configuration process along the 
protection paths can be conducted in a pipelined manner. Therefore, the 
impact on the restoration time by the routing strategy can only be the length 
of the working and protection paths. 

6.2 Survivability of Ring-based Non- WDM Networks 

In this section, we introduce protection and restoration operations in non- 
WDM networks with Automatic Protection Switching (APS) and SONET 
Self-Healing Ring (SHR), which have been well commercialized and 
standardized by industry. Although the two schemes are mainly used in ring- 
based optical networks (such as SONET), the basic structures and signaling 
mechanisms can be simply migrated to the WDM mesh networks. The 
following paragraphs give an overview on these two protection strategies. 

6.2.1 Automatic Protection Switching 

Automatic Protection Switching (APS) is a generic expression for the 
protection and restoration mechanisms originally designed for non-WDM 
networks, in which an overhead control channel is used to communicate 
between two neighbor nodes involved in performing a protection switch. 
APS is a general term referring to any type of automatic protection, which 
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has three main architectures: 1 + 1, 1 : 1 and 1 : N APS. The difference 
between them is the way of assigning protection resources. 

1 + 1 APS can be simply demonstrated in the form shown in Fig. 6.1(a). 
Each working path has a corresponding protection path. The source node 
transmits data flow over both the working and protection paths at the same 
time. Note that the source node (i.e., node A) in Fig. 6.1(a) splits the data 
traffic into two flows, between which the destination node randomly chooses 
one of the data streams at a normal operation. When a failure occurs to any 
one of the two routes, the other route can still keep the data transmission 
continue. The 1 + 1 protection provides the fastest restoration speed and 
requires no signaling overhead. However, the spare capacity consumed is 
intolerably high in some cases (100% redundancy in capacity), and can be 
reduced for a compromise between restoration speed and capacity- 
efficiency. 

In 1 : 1 APS, every working path (or link) has a corresponding protection 
path (or link). However, unlike the case of 1 + 1 APS, a protection path (or 
link) does not transmit traffic flow during normal operation. The data traffic 
is switched to the protection path from the working path right after a failure 
occurs. In this case, the protection path can be pre-configured instead of 
being just pre-planned for increasing restoration speed (since the 
configuration time is saved). The protection path can also be pre-planned 
such that best-effort traffic can utilize the spare capacity. With this, a 
notification packet must be sent by the destination node to the source node 
before traffic can be switched over. 

In 1 : N APS, as shown in Fig. 6.1(b), a protection path is shared by N 
working paths that are physically disjointed. This arrangement can perform 
failure recovery for any one of the N working paths at a time. After the 
failure is repaired, data traffic flowing in the protection path is returned to 
the original working path so that the protection resources can be made 
available for any future working path failure. 

In general, M : N protection refers to an APS scheme in which M 
protection paths are shared among N working paths. When more than M of 
the N working paths fail at the same time, up to M working paths with 
higher priority can be recovered from the failure in order of priority, while 
the other (N - M) working paths of lower priority will have to suffer from an 
interruption of failure. 

6.2.2 SONET Self Healing Ring 

A SONET self-healing ring is designed to set up a network in ring 
architecture, or in a concatenation of rings equipped with the standard 
signaling protocols defined in SONET Interoperable Forum (SIF) and 
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Optical Intemetworidng Forum (OIF). SONET self-healing rings are 
characterized by stringent recovery time scales of 50 milliseconds from a 
failure. The restoration process performed within the recovery time includes 
detection, switching time, ring propagation delays, and re-synchronization, 
which are derived from a frame synchronization at the lowest frame speed, 
(DSl, 1.5 Mb/s) [1]. However, the tradeoff for this very fast restoration 
speed is capacity efficiency, in which more than 100% (and up to 300%) of 
capacity redundancy may be required. 

Two types of SONET Self-Healing Rings are widely used: A 
unidirectional path-switched ring (UPSR), which consists of two 
unidirectional coimter-propagating fiber rings, referred to as basic rings; and 
a two-fiber (or four-fiber) bi-directional line-switched ring (BLSR/2, or 
BLSR/4), which is comprised of two unidirectional counter-propagating 
basic rings as well [2]. 

In UPSR, the basic design is for a channel level protection in two-fiber 
rings as shown in Fig. 6.2(a). UPSR rings dedicate one fiber for working 
Time Division Multiplexing channels (TDM timeslots) and the other fiber 
for corresponding protection chaimels. The traffic is sent along both fibers to 
perform a 1+1 protection. Whenever a fault occurs on a span, a loop-back 
mechanism is activated on the nodes adjacent to the link of failure (i.e., 
between nodes 1 and 5), as shown in Fig. 6.2(b). Different rings are 
connected via bridges or connectors. To satisfy a bi-direction connection, all 
the resources along working and protection fibers are consumed, and as a 
result, the throughput is restricted to that of a single fiber. It is clear that 
UPSR rings represent simpler designs than the other types of protection, and 
do not require any notification or switchover signaling mechanisms between 
ring nodes. However, they are capacity-inefficient since they do not address 
any resource sharing spatially between working and protection paths. In 
general, more than 200%~300% capacity redundancy is taken in this 
scheme. 
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Fig. 6.2. Illustrations of the functionality for the SONET Self-Healing 
Ring networks, (a) At normal operation, (b) A fault occurs between node 1 
and node 5. 

BLSR is designed to protect fibers with two possible variants, namely 
two-fiber (BLSR/2) and four-fiber (BLSR/4) rings. BLSR/2 is designed to 
overcome the spatial reuse limitations associated with two-fiber UPSR rings 
and only provides path protection. BLSR/2 evenly divides the capacity 
timeslots within each fiber into working and protection channels with the 
same direction, and has working channels on the fiber protected by 
protection channels on the other fiber, as shown in Fig. 6.3. Therefore, bi- 
directional coimections between nodes traverse the same intermediate nodes 
but on different fibers. This allows for sharing loads away from saturated 
spans and increases the level of resource sharing, which is a major advantage 
over two-fiber UPSR rings. 

Protection channels for working chaimels are pre-assigned based upon a 
fixed odd/even numbering scheme, and in the event of a fiber cut, all 
affected timeslots are looped back in the opposite direction of the ring. This 
is commonly termed "loop-back" line/span protection and avoids any per- 
channel processing. However, loop-back protection increases the distance 
and transmission delay of the restored channels. Furthermore, since BLSR 
rings perform line switching at the switching nodes (i.e., adjacent to the 
fault), more complex active signaling functionality is required. Further 
bandwidth utilization improvements can also be made by allowing lower- 
priority traffic to traverse on idle protection spans. 

Four-fiber BLSR rings extend upon the BLSR/2 concepts by providing 
added span switching capabilities. In BLSR/4 rings, two fibers are used for 
working traffic and two for protection traffic. Working traffic can be carried 
in both directions, which minimizes spatial resource utilization for bi- 
directional coimection setups. Line protection is used when both working 
and protection fibers are cut, and looping traffic around the long-side path is 
required. If only the working fiber is cut, less disruptive switching can be 
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performed at the fiber level. Note that all chaimels subject to failure are 
switched to the corresponding protection fiber going in the same direction. 



Time slots 

Fiber 2 Fiber 1 




Working 

lime slot Comespondiiig 



Protection 
time slot 



Fig. 6.3. Two-fiber UPSR divides the capacity time slots along a fiber 
into working and protection channels. 

Both two- and four-fiber rings provide node failure recovery for pass- 
through traffic. But it is clear that the BLSR/4 ring capacity is twice that of 
the BLSR/2 ring, and that the foiu- fiber variant can handle more failures. 
Essentially, all channels on all fibers traversing the failed node are line- 
switched away from the failed node. BLSR rings, unlike UPSR rings, require 
a protection signaling mechanism. Since protection channels can be shared, 
each node must have global per-wavelength link-state, and this requires 
signaling mechanisms over both directions of the ring. The signaling is 
achieved by an Automatic Protection Switching (APS) protocol, also 
commonly termed SONET APS. This protocol uses a 4-bit node identifier 
and hence only allows up to 16 nodes per ring. Additional bits are designated 
to identify the type of function requested and the fault condition (i.e., 
chaimel state). Control nodes performing the switchover functions utilize 
frame-persistency checks to avoid premature actions and discard any invalid 
message codes. 

6.3 Survivability in WDM Mesh Networks 

The evolution of communication networks, and their ability to deal with 
new applications and increasingly bandwidth demand, is an on-going 
process. The migration of ring topology to mesh topology has multiplied 
network transmission capacity and traffic dynamicity. However, protection 
for such networks is usually more complicated than for point-to-point links 




138 



or ring networks. The problem is sourced from the use of OXCs and other 
All-Optical components, where a mesh topology along with a new network 
layer, called the optical layer or WDM layer, is newly introduced. The 
WDM layer provides a huge amount of transportation bandwidth, which 
supports different higher-layer services such as SONET connections. Gigabit 
Ethernet, and IP-switched datagram traffic. Therefore, it is necessary to 
clarify the protection mechanisms dedicated for mesh WDM networks. 

6.3.1 A Spectrum of Protection Service 

Since there is only little difference between the cases of dedicated 
protection performed within a ring of a non-WDM network and within a 
mesh WDM network, we will not further address dedicated protection in the 
following paragraphs. We will return to dedicated protection in Chapter 8. In 
the rest of this chapter, shared protection will be the focus as it can take 
advantage of mesh topology to reduce spare capacity consumption. 

In this section, we first provide a simple model for the restoration time in 
shared protection. The constraint imposed by Shared Risk Link Group 
(SRLG) and the conventional shared protection schemes (including path- 
based and link-based protection) are briefly overviewed. 

63.1.1 Shared Risk Link Group (SRLG) Constraint 

SRLG describes the relationship between different working paths. SRLG 
constraint defines whether or not the spare capacity of different protection 
paths can be shared with each other based on the SRLG relationship of their 
working paths. To explore a maximum extent of resource sharing between 
different protection paths, the SRLG constraint is an important sharing 
constraint that needs to be addressed. 

The SRLG constraint stipulates that any two or more working paths 
sharing the same risk of failure (or in the same SRLG) cannot have their 
protection paths taking the same spare capacity. The purpose of following 
the SRLG constraint is to guarantee 100% restorability after failure on any 
single link or node in the network. An example of the SRLG constraint is 
given in Fig. 6.4. Since W2 traverses the link A-B, which shares the same 
risk of a single failure as Wl, the protection path for W2 should exclude the 
possibility of using any of the protection resources taken by Wl. Otherwise, 
a failure on link A-B will result in a resource conflict between Wl and W2 
when both paths switch their traffic to the same protection channel. 
Therefore, the SRLG constraint stipulates that W2 cannot take any network 
resources along PI for protection purposes. 

It is clear that as Wl becomes longer, more working lightpaths belonging 
to the same SRLG suffer the sharing constraint. We define the SRLG 
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constraint to be relaxed if extra switch-merge node-pairs are allocated along 
a working lightpath (or divide a large SRLG into several small ones), so that 
the sharing of protection resources can be improved. 




Fig. 6.4. An example for illustrating the SRLG constraint. 

The algorithms used in correlating the SRLG relationship and the SRLG 
constraint will be introduced in Chapter 7 and Chapter 8 for the networks 
with static and dynamic traffic, respectively. Based on the correlation result 
between different working paths, optimal spare capacity allocation and 
survivable routing can be performed. 

6.3.1.2 Path-Based Shared Protection 

For path-based protection, the source node of a working path computes a 
protection path by ensuring that it is diversely routed from the working path 
and that the SRLG constraint is met. If a fault occurs to any network element 
traversed by the working path, the terminating node in its control plane 
realizes the fault and sends a notification indicator signal to the first hop 
node of the path to activate a switchover. The source then immediately sends 
a wake-up packet to activate the configuration of the nodes along the 
protection path, and then switches traffic over from the working path to the 
protection path. An example of path-based protection is shown in Fig. 6.4, 
where W1 is protected by diversely routed protection path PI. 

As discussed in the previous paragraphs, for a working path, the 
restoration time from a failure strongly depends on the total length of 
working and protection path segments that circumvent the affected network 
element. Although path-based protection requires a simple signaling 
mechanism by circumventing any failure in an end-to-end fashion, it cannot 
guarantee the recovery time for lightpaths that need to meet stringent 
requirements on service continuity. In addition, with the path-based 
protection scheme, the SRLG constraint may limit resource sharing without 
any relaxation, and as a result may impair the performance. 

6.3.1.3 Link-Based Shared Protection 

Link-based protection has originally been devised for ring-based network 
architectures such as SONET, where network planning efforts significantly 
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influence performance. The migration of link-based protection from ring- 
based networks to mesh networks has been explored extensively in [4], [5] 
and [6]. In general, link-based protection in mesh networks is defined as a 
protection mechanism that performs fault-localization during the occurrence 
of a failure, and restores interrupted services by circumventing the traffic 
from a failed link or node at the upstream neighbor node, and merges the 
traffic back to the original working path at the downstream neighbor node. 
With the above definition, to protect both the downstream neighbor link and 
node, two merge nodes have to be arranged for every node along a working 
path. An example is shown in Fig. 6.5, where two merge nodes, B and C, 
have to be arranged for node A to protect W1 along link A-B and node B. 
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Fig. 6.5. An example of link-based protection. 

Due to fault localization and a better throughput as a result of the 
relaxation of the SRLG constraint, link-based protection provides very fast 
restoration. However, the downstream neighbor node and link require 
separate protection segments, which may impair capacity efficiency by 
consuming extra protection resources. To perform link-based protection on 
both links and nodes along a working path, a new scheme is required such 
that the consumption of protection resources is reduced without losing much 
restoration speed. 

6.3.2 SLSP Framework 

This section introduces the strategy of Short Leap Shared Protection 
(SLSP) with a detailed discussion. We will show that the SLSP framework 
generalizes traditional link- and path-based shared protection, and can 
provide a wider spectrum of service with finer restoration granularity. 

The SLSP protection scheme is an end-to-end service-guaranteed shared 
protection scheme, which enhances link- and path-based shared protection to 
provide finer service granularities and higher network throughput. The main 
idea of SLSP is to sub-divide a working path into several equal-length and 
overlapped segments, each of which is assigned (by the source node) a 
protection domain after the working path is selected as shown in Fig. 6.6. 
The overlap between adjacent protection domains (or P domains in the 
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following context) is to protect node failure along a working path. A 
diversely routed protection path segment is searched for each working path 
segment in a P domain. The diameter of a P domain is defined as the hop 
count of the shortest path between the Path Switched LSR (PSL) and Path 
Merged LSR (PML) of the P domain. 

Unlike the reported survivable routing schemes, SLSP performs 
restoration process within a pre-defined P domain instead of along the whole 
path. With each working path being segmented, the restoration service can 
be guaranteed by limiting the size (or the sum of the distance of working and 
protection path segments) of P domains along the working path. Compared 
with the path-based protection, the segmentation of working paths also 
yields less computation latency during path selection by using a fully 
distributed computing process. With this, the task of end-to-end diverse 
routing is divided into several sub-tasks, each of which deals with less 
information and link-states by the PSLs of the working path. 




ProtectioD domain 2 



Fig. 6.6. SLSP protection scheme divides the working path into several 
overlapped P domains. 

The definition of SLSP generalizes the shared protection schemes, in 
which link- and path-based shared protection can be categorized as two 
extreme cases of SLSP with domain diameters of 1 and H respectively, 
where H is the hop count of the working path. Note that SLSPl (i.e., 
protection domains with a single hop as diameter) cannot perform node 
protection, and needs to co-work with SLSP2 to form an end-to-end link- 
based protection scheme as defined in the previous section. Here we focus 
on SLSP2, and take SLSP2 as an approximation of link-based shared 
protection. 

Fig. 6.6 illustrates how a lightpath under SLSP is configured and 
recovered when a fault occurs. Node A is the source node and node N is the 
termination node. The first P domain (protection domain 1) starts at node A 
and ends at node F. The second P domain (protection. domain 2) is from 
node E to node J, and the third is from node 1 to node N. In this case (A, F), 
(E, J), and (I, N) are the corresponding PSL-PML pairs for each P domain. 
Since each P domain is overlapped with its neighW P domains by a link 
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and two nodes, a single failure on any link or node along the path can be 
handled by at least one P domain. After a fault on the working path occurs, 
the PSL of the P domain where the failure occurs is notified to activate a 
traffic switchover. For example, a fault on link C-D or node C is localized by 
its downstream node D. A fault on link F-G or node F is localized by the 
downstream node G. In the former case, node C sends a Notification 
Indication Signal (NIS) to notify node A that a fault has occurred in P 
domain 1. In the latter case, node G sends an NIS to notify node E that a 
fault has occurred in P domain 2. In the event that a failure occurs to a link 
or node covered by two P domains (e.g., link E-F), the node localizing the 
fault (i.e., node F) will notify the closest upstream PSL (i.e., node E in this 
case) for performing a restoration. After receiving the NIS, the PSL (i.e. 
node A or E) immediately sends a wake-up packet to activate the 
configuration of each node along the corresponding protection path segment 
of its P domain, and then switches the traffic over the protection path. The 
adoption of a Tell-and-Go mechanism, by which the traffic flow is switched 
to the protection path without waiting for back and forth acknowledgements, 
can fiirther reduce total configuration time to a minimum. A similar idea will 
be introduced in Chapter 9 on the topic of Optical Burst Switching. 

There are three main advantages of the SLSP framework as opposed to 
the ordinary path protection schemes. First, both the notification and the 
traveling of the wake-up message are performed within a P domain, and 
therefore the restoration time can be guaranteed by adjusting the size of the 
P domains along a working path. Second, the computation complexity of 
solving protection paths is simplified as a result of the segmentation of 
working paths. The fully distributed allocation process is scalable to the 
length of working paths. Compared with path-based protection, fewer 
protection resources need to be marked as prohibited by each PSL, and thus 
the total computation complexity for correlating the SRLG constraint can be 
reduced. Third, due to the segmentation of working paths, more resomce 
sharing can be achieved by relaxing the SRLG constraint. An example is 
given as follows to show that the segmentation of working paths improves 
capacity efficiency. In Fig. 6.7(a), working paths W1 and W2 have an 
overlapped span in both of the P domains S-A-K-H-I-J-D-G-F-E-S and A-K- 
H-I-J-D-G-F-B-A for W1 and W2 respectively, so that they share the same 
risk of a single failure. To restore W1 and W2 at the same time once a failure 
has occurred on span F-G, the number of spare links prepared for W1 and 
W2 should be the sum of the bandwidth of W1 and W2 along the spans A-K- 
H-I-J-D-G. With W2 being segmented into two P domains A-K-H-I-J-D-F- 
B-A and B-C-D-G-F-B as shown in Fig. 6.7(b), the spare capacity along the 
spans A-K-H-I-J-D for W1 and W2 can be the maximum bandwidth of the 
two working paths. In other words, the segmentation of the working path W2 
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at node F saves the spare capacity required. The protection path for F-G of 
W2 can be in the newly assigned P domain B-C-D-G-F. In this example, if 
each working path has the same bandwidth, 2 spare links of the bandwidth 
are saved after the reconfiguration. 




Fig. 6.7. In (a), the spare capacity has to be the sum of the two working 
paths. In (b), the spare capacity can be shared by W1 and W2 along A-K-H- 
I-J-D by dividing W2 into two P domains. The protection path segment for 
WI and W2 is marked as a dashed-line and a dot-dashed-line respectively. 
The circles in the two graphs are the protection domains. 

From a network administrative point of view, SLSP provides tradeoffs 
between the restoration time and the amount of protection resources 
consumed. The major disadvantage incurred by using SLSP is the increase 
of signaling complexity. 

6.4 Summary 



This chapter introduced the basic ideas of performing protection and 
restoration in optical networks. Protection and restoration are two most 
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important functional elements to achieve network survivability especially in 
the event that an All-Optical transportation infrastructure is adopted. Future 
communication networks should be survivable and robust to any failure in 
the transportation plane, where a guaranteed end-to-end protection service 
along with efficient signaling for fast restoration is desired. We provided in 
this chapter an overview on different types of protection and restoration 
schemes that have been well developed or standardized. The topics include 
ring network protection such as SONET Self-Healing Ring (i.e., UPSR and 
BLSR) and Automatic Protection Switching (APS) as well as the mesh 
network protection schemes such as shared, dedicated, link-based, path- 
based protection, and Short Leap Shared Protection (SLSP). Most of the 
above topics will be further detailed in the following chapters. 
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Questions 

1. Explain the importance of scalability, dynamicity, class of service, 
provisioning speed and capacity efficiency to the design of survivable 
optical networks 

2. In your own words, define protection and restoration in optical networks. 
Describe the general approach to achieve this purpose. 

3. What is the difference between pre-designed and real-time-searching 
restoration schemes? Discuss the advantages and disadvantages of both 
schemes. 
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4. Calculate the average capacity redundancy in UPSR and BLSR (Hint: 
enumerate all the possibilities for a session to be on the ring). Does any 
resource sharing occur to UPSR and BLSR? Explain your answer. 

5. What happens if the protection and working paths in dedicated protection 
have different physical length? What should we do to cope with this 
situation? 

6. What is the main difference between shared and dedicated protection? 
Discuss their advantages, disadvantages, and the design purposes. 

7. What is 1+1, 1: 1, 1:N, and M:N protection? Give an example of each. 

8. A fraction of network is illustrated in Fig. Q.8. W1 is a working path 
protected by PI in shared protection. 




Fig. Q.8. 



(a) Explain why the bandwidth of PI has to be the same with that of W1 . 

(b) Is there any difference in restoration time between the cases where 
the fault occurs to link 1 or link 2? 

(c) Assume every link has the same physical length m. Calculate the 
restoration time when a fault occurs on link A-B. Assume that the 
node configuration time is Tconf, the node processing time is Tproc, and 
the time to localize the failure is Tiocai- Assume the notification packet 
can cut through any intermediate node with negligible amount of 
time consumed. 

(d) If the effective transmission bandwidth of W1 is B, what size of 
buffering does the end node need to have without losing the data 
integrity? 

(e) Assume W1 is protected by PI in dedicated protection, where the 
source node launches the same copy of data in both working and 
protection paths. Calculate the restoration time in this case. 

9. In Fig. 6.1(b), working path 1, working path 2 and working path 3 have 
optical flow with bandwidth OC-3, OC-3, and OC-12, respectively. What 
is the minimum bandwidth required in the protection path between D and 
D’? What is the percentage of spare capacity redundancy in this case? 

10. Unlike UPSR, BLSR/4 has each pair of adjacent nodes interconnected by 
two fibers in a direction. Therefore, BLSR/4 allows more than one 
connection sessions set up at the same time along the same ring. 
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Calculate the average ratio of redundancy (i.e., ratio between working 
capacity and protection capacity) in this case. 

11. Compare the total transmission capacity (or average and maximum 
throughput) of a SONET BLSR/4 and UPSR systems. 

12. Fig. Q.ll shows a SONET UPSR system with multiple interconnected 
rings. Assume that each node has an equal tendency to launch traffic to 
the other nodes. No blocking can possibly occur. 

(a) Calculate the average transmission length in hop count of each 
connection. 

(b) Assume for each link in the outer rings, the probability of occupancy 
is Bi, the links of the inner ring are subject to the probability of 
occupancy by B 2 . What is the average blocking probability for a 
connection request? 

(c) Assume that the system is a SONET BLSR, repeat (a) and (b). 




Fig. Q.12. 

13. What are Unk-based protection and path-based protection schemes? 
Outline the characteristics of the two shared protection schemes. 

14. What is a Shared Risk Link Group? In Fig. 6.1(a), there exists a single 
SRLG. What are the paths contained in the SRLG? 

15. Why does SLSP relax SRLG constraint? 

16. Describe the signaling mechanisms for the restoration in a P domain 
when a failme occurs. 

17. In Fig. 6.4 where a path-based protection is exemplified, do you agree 
that the location of the failure occurring to the working path will affect 
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the restoration time? Which of the failures on link 1 and link 2 can be 
recovered faster? Explain your answer. 




Chapter 7 



SPARE CAPACITY ALLOCATION 



This chapter introduces spare capacity allocation schemes performed 
either off-line for a static deployment, or on-line during a time interval 
between two consecutive network events (i.e., a connection setup or tear- 
down). As survivability becomes one of the most important issues in the 
design of control and management of WDM networks, development of 
protection and restoration schemes became critical to network applicability 
and/or revenue-making capability. As mentioned in Chapter 6, pre-planning 
or pre-configuration of spare capacity for some working paths with stringent 
quahty-of-service requirements, are the most commonly seen approaches for 
achieving this goal. However, if the allocation of spare capacity cannot be 
both capacity- and computation-efficient, the applicability of the algorithms 
may be limited, and the network cost for control and management increased. 
Therefore, we need to develop a spare capacity allocation scheme which 
meets the requirements of computation-efficiency, capacity-efficiency, class 
of service, and adaptability to traffic variation. 

This chapter is organized as follows. Section 7.1 introduces design 
principles and objectives of spare capacity allocation. Section 7.2 defines 
span-oriented spare capacity allocation schemes, which includes such topics 
as Node Cover, Pre-configured cycle (or p-cycle), and Protection Cycles (or 
Ring Cover). These schemes do not consider the relationships among 
different working lightpaths existing in networks; instead, spare capacity is 
deployed according to working capacity along each link. Section 7.3 
presents spare capacity allocation schemes that achieve resource sharing on a 
per-lightpath basis (also called path-oriented spare capacity allocation 
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schemes). This scheme yields better capacity efficiency at the expense of 
incurring a larger computation complexity. The reason the path-oriented 
approaches take more computation time is that they need to coordinate inter- 
relationships between different working lightpaths in order to determine the 
SRLG constraint. In addition, the number of working paths to be handled 
strongly determines computation complexity. The above two aspects 
differentiate between path-oriented spare capacity allocation schemes and 
span-oriented schemes. 

In Section 7.4 we present unique network planning efforts existing in 
WDM networks, which are concluded in the inter-arrival planning. In 
addition to the scheme for facilitating RWA introduced in Chapter 4, this 
chapter further defines the mechanisms in an inter-arrival planning process 
for achieving survivability. In Section 7.5, a service-guaranteed spare 
capacity re-allocation scheme called Short Leap Shared Protection with 
Spare Capacity Re-allocation (SLSP-R) is introduced. SLSP-R is based on 
the SLSP framework defined in Chapter 6, where a working lightpath is 
segmented into a number of protection domains (or P domains) with the 
same diameter. SLSP-R provides a framework for performing spare capacity 
re-allocation in a network with dynamic traffic, and is committed to the task 
of initiating a con^romise between the extent of resource sharing on spare 
capacity and computation complexity according to traffic dynamicity. In 
Section 7.6, a summary is given along with concluding remarks on each of 
the schemes introduced in this chapter. 

7.1 Principles and Objectives of Spare Capacity 
Allocation 

One of the most common forms of failure in an optical fiber 
communication system is the accidental disruption of communication 
conduits buried underground, which may impair a huge amount of 
bandwidth in transmission. The most prevalent method of overcoming such 
inevitabilities is to pre-plan or pre-configure protection resources for each 
working path so that the traffic influenced by the failure can be switched 
over to the spare resources. We have pointed out in Chapter 6 the importance 
of survivability and service continuity in an optical network, and asserted 
that not only should the restorability for all failure situations be guaranteed 
as economically as possible but also that the restoration time be below the 
upper limit stipulated in the Service Level Agreement of each connection. 

With the above concepts, it is quite intuitive to achieve survivability with 
the knowledge of all working capacity existing in the networks. The 
development of a capacity- and computation-efficient spare capacity 
allocation scheme has been a research issue of much interest for the past ten 
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years. Most of the reported studies were performed at the network planning 
stage with an NP-hard optimization process. As the network traffic becomes 
dynamic and the restoration time requirement becomes more stringent, the 
existing optimization schemes can hardly be feasible due to their long 
computation time. 

For the sake of language precision, important terms such as “failure 
propagation”, “span-oriented” and “path-oriented” should be properly 
defined. “Failure propagation” [1] is the phenomenon in which a failure 
upon a single network element (e.g., a fiber cut) may cause multiple and 
simultaneous failures at the virtual topology of higher level links. For 
example, a fiber cut may interrupt a number of working lightpaths 
connecting different node pairs. In this case, a failure on a fiber “propagates” 
simultaneously to multiple lightpaths traversing through the fiber. The 
failure propagation phenomenon should be considered only in “path- 
oriented” spare capacity schemes because they concern each lightpath. 

A “span-oriented” spare capacity allocation scheme is the design 
approach for deploying spare capacity for protecting working capacity along 
each span. In this case, working capacity along each span is considered as a 
“bulk of bandwidth" instead of distinct working lightpaths on different 
wavelength planes traversing through the span. The most common strategy 
of deploying spare capacity for each span is in a ring, where any piece of 
working capacity along the span can be protected since it is “on a ring” or 
“straddling across a ring”. The amount of spare capacity needed along a span 
is determined by the amount of working capacity on a pre-arranged ring to 
which the span belongs. Since the restoration process is performed as if the 
span is in a ring network that suffers a failure (which is similar to the 
situation of SONET self-healing ring or Automatic Protection Switching 
(APS) ring discussed in Chapter 6), the problem of failure propagation can 
be totally set aside, and the signaling latency and configuration time can be 
reduced to a minimum. 

In a mesh network, span-oriented schemes have been well recognized as 
feasible approaches with high restoration speed and low capacity efficiency 
[2,3]. The fast restoration fi'om a failure is due to the fact that the 
deployment of spare capacity along each link is dedicated to the working 
capacity along a specific physical span. In such circumstances, the spare 
capacity can be pre-configured instead of being simply pre-plaimed, which 
saves time in the propagation of signaling and the configuration of devices 
(e.g., switching fabrics) when a restoration process takes place. However, the 
uniform capacity distribution along each ring deployed in the network limits 
resource sharing, and as a result impairs the capacity efficiency. It is 
reported 5%~10% of excess spare capacity is needed in p-cycle in most of 
the cases. For an IP or MPLS network in which traffic granularity is small 
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and traffic is dynamic, the span-oriented spare capacity allocation can be 
suitably adopted due to its reduced computation complexity by ignoring 
some possibilities of resource sharing between different working paths. The 
very fast restoration mechanisms come at the expense of taking larger 
amount of spare capacity. Another important reason for the path-based 
schemes suitable for WDM networks is that the wavelength channels in 
different wavelength planes are not normally interchangeable and are also 
expensive. The span-oriented schemes may have to be subjected to much 
higher computation complexity during the optimization process in each 
wavelength plane. In addition, it may also not be able to meet the design 
requirement in capacity efficiency from the reported documents [2,3] (i.e., 
more than 100% capacity redundancy is taken). Elaborations for improving 
resource sharing with the span-oriented approach are necessary. 

As for the other end of the design spectrum, the path-oriented spare 
capacity allocation is defined as a design approach for spare capacity 
allocation that takes the relationship between different working paths into 
consideration in order to achieve the maximum extent of resource sharing. In 
a wavelength-routed optical network (especially for a long-haul network), 
the granularity of bandwidth is large, and the occupancy of a wavelength 
channel (usually OC48 or OC-192) is expensive. The only way to improve 
the capacity efficiency from the span-oriented approaches is to “break” any 
pre-designed pattern of spare capacity (e.g., a ring, tree, or a mix of both), 
which requires uniform spare capacity along each pattern. With the path- 
oriented approach, spare capacity is allocated according to where the 
working paths are, and how much spare capacity the working paths need to 
reserve. In addition, the path-oriented approach is more suitable for a 
wavelength-routed network due to the wavelength continuity constraint. 
Different wavelength planes along a span are independent and non- 
interchangeable with each other in case wavelength conversion capability is 
absent or insufficient (which will most likely be the case in the future optical 
Internet). Therefore, taking the working capacity as a “bulk of bandwidth” 
(as required in the span-oriented schemes) is no longer feasible; instead, 
global per-wavelength link-state is needed to distinguish the spare capacity 
on each wavelength plane. Thus, the advantage of computation efficiency in 
the span-oriented schemes is lost when all wavelength planes have to be 
considered separately. In the rest of this chapter, we wiU give detailed 
discussions of both categories of spare capacity allocation schemes. 

7.2 Span-Oriented Spare Capacity Allocation Schemes 

To perform 100% restorability with the span-oriented spare capacity 
allocation schemes after any failure occurs, a specific group of protection 
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paths has to recover the working capacity they are assigned. Algorithms 
have been developed to cover the working capacity of whole networks with 
the least amount of spare capacity in the shape of a ring, a tree or a mixture 
of both, to achieve 100% restorability from any single failure. Once a fault 
occurs on any span, a restoration process is initiated to switch over the traffic 
along the affected link into a pre-defined spare route. This section introduces 
a number of reported span-oriented spare capacity allocation schemes, which 
include Node Cover, Span Protection, Ring Cover and Pre-configured Cycle 
(p-cycle). 

7.2.1 Node Cover 

This sub-section introduces an approach for performing a span-oriented 
restoration called Node Cover. This scheme tries to allocate a single spare 
cycle with a uniform bandwidth covering all the network nodes, as shown in 
Fig. 7.1. When a failure occurs to a span, the traffic traversing through the 
span is switched over to the spare cycle for restoration. The capacity of the 
cycle has to be equal to, or larger than, the maximum amount of traffic flow 
in any link of the network so as to guarantee 100% restorability. 




Fig. 7.1. An example of the Node Cover spare capacity allocation 
scheme. 

Node Cover can handle any single link failure with relatively less spare 
capacity consumed than is the case for the other types of protection schemes. 
However, a fatal drawback of Node Cover is that the restoration time could 
be unacceptably long if the network is large. The Node Cover scheme can 
also use a tree pattern of spare capacity (or other ways, such as a mix of trees 
and rings) to cover all nodes. The ring type pattern is most commonly seen 
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due to the characteristics of robustness and easy implementation. An 
algorithm for deploying a ring with minimum distance in hop count 
traversing all network nodes can be seen in reference [4]. 

7.2.2 Span Protection 

We take a proposal by Herzberg and Bye [5] as exemplary of Span 
Protection schemes. Span Protection equips every span with a protection 
path to deal with any single failure along this span. The algorithm uses a 
Linear Programming (LP) formulation to find all distinct logical routes that 
are “eligible” for restoration of a failure along any span. Since this method 
was originally designed for IP networks, the number of protection paths 
corresponding to a working path can be more than 1, each of which 
contributing a portion of restoration flows to the corresponding working 
capacity when a failure occurs. Therefore, the scheme cannot be used in a 
wavelength-routed network except when an integer constraint for each 
restoration flow is asserted. This formulation needs to be modified such that 
only an integer solution is usable when it is applied to a WDM network. 
Therefore, Linear Integer Programming instead of Linear Programming is 
required. To reduce the design space, hop limits are imposed in the selection 
of protection routes for each span, which also reduces the computation 
complexity by reducing the number of eligible protection routes for a span. 
Then the algorithm assigns spare capacity for the working capacity of each 
span by selecting a set of eligible restoration routes that yield a minimum 
amount of total spare capacity. This scheme can guarantee 100% 
restorability for any single failure upon the existing working capacity in the 
network. 

The LP formulation is as follows 

Objective: minimize 

(7.1) 

;=i 

Subject to the constraints: 

P=1 



(7.2) 
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p=l 

4p>0 i = l,2,...,L, p = 1,2,.., Pi 



(7.3) 

(7.4) 



where L is the number of links, indexed j = 1,...,|l|; Zjis the maximum 
possible restoration flow through link j; Cj is the cost of a spare channel 

assigned to link j, j = 1 ,..., |l| ; is the total number of candidate 
restoration routes for link i which do not violate the hop limit; ^ is the 
restoration flow through the pth restoration route of link /, i = 1,. . |l| ; q. is 

the total restoration level required for link i, i = 1 ,..., |zj ; S/^, is a binary 
indicator which takes the value of 1 if the pth restoration route of link i uses 
link j, and 0 otherwise, and X, is the number of units of working capacity 

(or channels) through link i, i = 1, 2, |zj . 



Note that for serving in a WDM network, f.^ in the above formation has 

to be an integer to reflect the fact that there is a basic non-divisible unit of 
bandwidth in the optical layer (i.e., the bandwidth of a lightpath). Therefore, 
Eq. (7.4) should be constrained by one more condition that ^ must be an 

integer; thus the above formulation becomes Integer Linear Programming 
(ILP). The first constraint in Eq. (7.2) ensures the restoration levels required. 
Given a single failure scenario, q, is equal to 1 for all i. The second 
constraint in Eq. (7.3) stipulates that the maximum un-directed flow 
Zj should be larger than, or equal to, the restoration capacity required by any 

possible failure in the network, which is the summation of all the un-directed 
flows /( p using the link j. 

The following is an example of the above formulation at work. As shown 
in Fig. 7.2, there are 6 working flows in the sample network with 7 nodes 
and 10 spans. The algorithm first finds all the eUgible protection routes for 
every span that may be subjected to a failure. The protection routes should 
be up to a length limit in hop count, say 3 hops in this case. We have all the 
protection routes corresponding to each span listed in Table 7.1. Then, an 
optimization process through an Integer Linear Programming formulation is 
performed to derive the minimum amount of weighted spare capacity needed 
to restore any possible failure along each span. 
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Fig. 7.2. An example to show the basic idea of the ILF formulation for a 
path-oriented spare capacity allocation scheme. 



Network Link 


Protection Routes 


A-B 


A-D-C-B and A-C-B 


A-C 


Do not need 


A-D 


A-B-C-D and A-C-D 


B-E 


B-C-F-E 


B-C 


B-A-D-C andB-E-F-C 


C-D 


C-B-A-DandC-F-G-D 


C-F 


Do not need 


D-G 


D-C-F-G 


E-F 


E-B-C-F 


F-G 


F-C-D-G 



Table 7.1. The eligible protection routes for each network link with 
working traffic. The hop count limit is 3. 

7.2.3 Ring Cover 

The wide prevalence of Automatic Protection Switching (APS) has 
inspired the deconq)osition of mesh networks into multiple APS units, which 
has generated the basic idea of Ring Cover. With Ring Cover, spare capacity 
is pre-planed or pre-configured into a set of rings for protection purposes. 
The operation of each APS ring has been discussed in Chapter 6, which 
specified that nodes adjacent to failure simply switch the affected traffic 
from the corresponding working to the protection fibers upon a failure. The 
basic operation of ring cover is to group the network resources into APS 
rings and cover each link with at least two rings (one in each direction), as 
exemplified in Fig. 7.3. As any span in this topology is subjected to a failure 
(e.g., a fiber cut on the link A-B), Ring 1 or Ring 2 or both of them will be 
activated to restore the failure. Whether both of the rings are activated or not 
depends on the amount of working capacity traversing the link A-B. 
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However, if there is more working capacity traversing through link A-B than 
can be restored by the two rings, some working capacity will not be restored 
at the occurrence of the failure. Therefore, to guarantee 100% restorability, 
the logical partition of the spare capacity should be either based on the 
distribution of working capacity or with an upper limi t depending on where 
the working capacity is routed. The settings and architecture before and after 
a link failure are shown in Fig. 7.4(a) and (b). The automatic loop-back 
recovery mechanism is performed between two adjacent nodes upon a 
failure. The fibers of one ring (in both directions) are backed up by the fiber 
on the other ring. Assuming that the networks are two-edge connected by bi- 
directional links where both working and protection resources are allocated, 
the network resources can be partitioned into a set of APS rings such that 
each link belongs to exactly two rings, and all links are included in the 
partition regardless of whether or not the networks are planar or non-planar. 




Fig. 7.3. An illustration of the ring cover for a WDM mesh network. 




(a) 
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Optical switch 
I I 2-by-2 switch 



Fig. 7.4. Switch setting and architectures for APS with the physical 
topology shown in Fig. 7.2 (a) before a failure along link A-B occurs (b) 
after a failure along link A-B occurs. The solid lines are for working paths, 
and the dashed lines are for protection paths. The 2-by-2 switch can either 
be in “BAR” status at normal operation or “CROSS” status at the recovery 
operation. 

Note that the above definition of Ring Cover is not specifically limited to 
WDM networks. To apply this technique to multi-wavelength networks 
without a full wavelength conversion capability in each node, wavelength 
planes along a fiber can be split into two folds of equal size, one for working 
and the other for protection capacity in order to minimi z e transmission 
granularity. Each wavelength on fiber 1 is backed up by the same 
wavelength on fiber 2, as shown in Fig. 7.5. For a network with a larger 
traffic capacity, each fiber (or group of fibers) can be treated as either 
working or protection capacity in a direction. 
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Wavelength 

plane 



Fiber 2 



Fiber 1 




Working 

charuids 



Corresponding 



Protection 

charmed 

] 



Fig. 7.5. An illustration of APS systems with 2 fibers in each direction in 
mesh networks. 

Based on the idea of Ring Cover, a frequently used protection proposal 
for WDM networks is the Protection Cycles scheme, which adopts the 
signaling mechanisms defined in the Optical Automatic Protection 
Switching Protocol (0-APS) introduced in Chapter 6. The Protection Cycles 
scheme is an implementation of Ring Cover, and is intended to improve the 
Ring Cover scheme in its performance behavior. Algorithms were developed 
to cover each link with double cycles for both planar and non-planar 
networks [6,7] so that the 0-APS protocol [8] could be performed in each 
ring. Basically, signaling propagation mechanisms in 0-ASP have no major 
difference fi'om those in non-WDM networks; however, the signaling 
packets and primitives for handling network-wide information are in need of 
extensions. Terminology and network architectures are defined in 0-ASP for 
equipping the Protection Cycles scheme in WDM networks with a suite of 
explicit implementation techniques and specific building components. 

The scheme of Protection Cycles also suffers from the problem of 
scalability and requires a global reconfiguration in response to a local 
network variation. Although the scheme successfully defines the architecture 
of Ring Cover in WDM networks, the most serious problem with it is that 
the capacity efficiency is not acceptable due to very limited spare resource 
sharing. As a result, the redundant capacity required is usually more than 
100% of the total amount of the working capacity that is under protection 
(this is also called spare capacity redundancy). This characteristic of the 
Protection Cycles scheme has stimulated the development of more capacity- 
efficient schemes based on a similar design approach. 
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7.2.4 Pre-configured Cycle 

The Pre-configured cycle (or p-cycle) is a well-developed strategy for 
deploying spare capacity in the pattern of a ring at the network planning 
stage. Unlike the Ring Cover scheme, p-cycle allows a span to be protected 
“straddling” across a ring so as to reduce waste of segmentation of capacity 
[2,3,11]. The shape of spare capacity (or a “pattern” in the terminology of p- 
cycle) that is allocated into the network as a basic unit is pre-configured 
instead of being pre-planned, which provides very high restoration speed 
from any link failure at the expense of a poor utilization in the spare capacity 
since the best-effort traffic can hardly utilize these network resources. With 
the knowledge of all working capacity in each span, p-cycle formulates the 
problem of minimizing summation of spare capacity along each span into an 
optimization process, using Integer Linear Prograimning (ILP), in which 
different patterns are chosen from a prepared set of candidate cycles. The 
result of the optimization is the number of copies of each different cycle 
pattern, which can be 0 or any positive integer. 

Protection Cycles and some other span-oriented restoration schemes deal 
with failures along a protection ring (or an on-cycle failure). With p-cycle, 
each cycle provides restoration for both on-cycle and straddling failure. The 
straddling failure occurs on the link connecting two nodes located on a cycle, 
as shown in Fig. 7.6(c) and (d). The inclusion of straddling failure into 
protection by each cycle improves the resource sharing and makes p-cycle 
more attractive. 






Fig. 7.6. (a) A pattern of p-cycle in the network, (b) an on-cycle failure is 
restored by using the rest of the pattern, (c) and (d) a straddling failure can 
be restored by either side of the pattern. 
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As p-cycle does not allow resource sharing between different cycles, the 
relationship between different working paths does not need to be considered. 
The formulation of the optimization problem is as follows [20]: 

Target: minimize 

s 

(7.5) 

Subject to the following constraints: 

V/ = 1.2 S (7.6) 



L + 0 V/ = 1,2,..., S , (7.7) 

i=l 

where j = 1, 2, ..., S, and 

f 1 span j has a link on pattern i 
|0 otherwise 



Both nodes of span j not on pattern i 
Span J is on-cycle with the pattern i 
Span 7 is straddling with the pattern i 

For each of the parameters, nj is the number of copies of the pattern i; wj 
and Sj are the number of working and spare links on span j, respectively; rj is 
the spare links excess of those required for span j. The target fimction in Eq. 

(7.5) is to minimize the total cost for spare capacity. The constraint in Eq. 

(7.6) stipulates that the number of patterns traversing span j is equal to the 
number of units of spare capacity. The constraint in Eq. (7.7) stipulates that 
the restoration capacity for link; prepared by all the other links should be 
Wj + rj , which is larger than the working capacity on link j. 

In the above formulation, the assumption is made that all the cycles (i.e., 
all the patterns) up to a size limitation (e.g., physical distance, hop count, 
etc) in the physical topology are determined. The Integer Linear 
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Programming formulation tries to find the best arrangement of these patterns 
to minimize the amount of spare capacity required. 

Although the amount of spare capacity required to achieve 100% 
restorability can be as low as 50% of working capacity [2,3], the deployment 
of pre-configured cycles means a higher computation complexity than with 
Node Cover schemes as more design dimensions need to be considered, such 
as the number of copies of a specific cycle. The process of solving the ILP 
formulation p-cycle is notorious for its NP completeness, which can prevent 
the scheme from being used in large networks, or ones with an even slightly 
variant traffic pattern. Therefore, we need a strategy for trading capacity 
efficiency with computation complexity in implementing p-cycle. Heuristic 
algorithms were provided [11], in which different cost functions were 
developed to address either capacity efficiency, traffic capture efficiency, or 
a mixture of both, at the expense of the consumption of spare capacity in the 
networks. After a ring is allocated, the covered working links will never be 
considered in the subsequent iterations. The iterative process ends when all 
the working links are protected with enough amount of spare capacity. In 
considering both intra-cycle and inter-cycle protection, the cost function 
developed takes both capacity efficiency and traffic capture 

efficiency into account [11]. 

Inevitably, the deployment of ring-shaped patterns into mesh networks 
may still result in excess spare capacity because of uniform capacity along 
each ring. Besides, since a lightpath is an explicitly connection-oriented 
physical conduit, the working capacity along each span is no longer of much 
interest especially when the wavelength continuity constraint exists. 
Therefore, the span-based spare capacity schemes originally devised for IP 
layer protection with a connectionless transmission mode become not 
feasible any more. This observation has motivated researchers to emphasize 
the development of path-oriented spare capacity allocation schemes for 
WDM wavelength-routed networks. We will focus on this topic in the rest of 
the chapter. 

7.3 Path-oriented Spare Capacity Allocation Schemes 

As mentioned in Section 7.2.4, one of the major disadvantages of using 
the span-oriented spare capacity allocation schemes in WDM networks is 
that the wavelength continuity constraint impairs the computation efficiency. 
In an all-optical network, all network resources (i.e., wavelength channels) 
along a protection cycle have to be on the same wavelength plane, with its 
corresponding working capacity, so that the traffic can be switched over. 
With such a consideration, the optimization has to be performed on each 
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wavelength plane separately, which distorts the advantage the span-oriented 
approaches originally possessed. 

This section focuses on the spare capacity allocation scheme which takes 
the relationships between different working paths into account (also known 
as the path-oriented spare capacity allocation scheme). Traffic along each 
working lightpath influenced by a failure is switched over to the 
corresponding protection resources separately. Since each protection path 
has a corresponding working path, no excess spare capacity can be possibly 
generated (which is different from the case of the span-oriented schemes); 
thus a larger extent of resource sharing can be achieved. In addition, due to 
the fact that each working path is equipped with an end-to-end protection 
path, working and protection paths can take different wavelength planes for 
better flexibility in path selection, which also achieves a larger degree of 
resource sharing between protection paths of different wavelength planes. 
Therefore, the main purpose of using the path-oriented schemes is to 
improve network throughput. This characteristic may be a design objective 
in some situations (e.g., for a long-haul or core network) where the traffic is 
not very variant and the capacity efficiency dominates the network total cost. 
Since the relationships between different working paths are considered, the 
confutation complexity is significantly greater than that of the span-oriented 
schemes. How to find a suitable compromise between computation 
complexity and network throughput is the focus in this chapter. 

7.3.1 Correlation of Working Paths 

Path-oriented spare capacity allocation schemes are characterized by their 
requirement to investigate relationship between different working paths that 
are in an SRLG, which significantly increases the computation complexity. 
Even worse, the effort to formulate this problem (i.e., the correlation of the 
relationships between different working paths and their protection paths) is 
nonetheless linear. However, this is the only approach that can achieve an 
optimal design of spare capacity allocation. Before any further introduction 
is given to the approaches of spare capacity allocation, a systematic and 
efficient method of correlating the per-flow relationship must be presented. 

The sharing constraint between different protection paths can be 
correlated by using the matrix approach demonstrated by Y. Liu [13]. 
Working and protection paths are represented by 1 x|l| binary row vectors: 
Or = {Or.t} and br = {br,i}. The Ith element in these vectors equals 1 if the 
corresponding path traverses link /. In [13] the author defined the path-link 
incidence matrices for both working and protection paths as the collection of 
all the incidence vectors, which form |WD|x|l| matrices A = {orj} and B = 
{brj}. The spare provision matrix C = (Cy) • M • A , where M is a 
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diagonal matrix with a size of |£>|x|d| . The minimum spare capacity 
required to perform 100% restorability for any single failure is the maximum 
of each row of the matrix C. The following is an example of using the matrix 
approach to derive a minimum spare capacity along each span given the 
distribution of all working and protection paths. 

As shown in Fig. 7.7, the working paths are marked 1 to 7 and the nodes 
are marked A to G. It is important to remember that this method determines 
the minimum amount of total spare capacity along each span given the 
working and protection paths in the network, which has nothing to do with 
path selection. The physical locations of the working and protection path- 
pairs are listed in Table 7.2. Assuming that each node pair has a traffic 
demand for establishing a single lightpath with the same bandwidth, the 
traffic demand matrix is an identity matrix. Therefore, the spare provision 
matrix C =B^ -M • A = • A . Using the data in Fig. 7.7 as an example, 

the spare provision matrix C is: 

000000000 0 
0 1 0 1 2 0 0 1 1 1 

1 1 0 0 0 0 2 0 0 1 

0000100010 
1200100000 
1 1 0 0 0 0 1 0 0 1 

0010010000 
0 0 0 0 1 0 1 0 1 0 

0 0 110 1110 2 
0 0 1 0 1 1 0 0 1 0 

Thus we can derive the minimum spare capacity required along each 
span to achieve 100% restorability for all possibilities of single failure in the 
network from the maximum element of each row in matrix C (i.e., s = [0, 2, 
2, 1,2, 1, 1, 1,2, 1] for link 1~10). 
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Fig. 7.7. An example of using a spare provision matrix for deriving the 
minimum spare capacity required along each span. 
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d(B-C-D-G. or 4-8-10) 


B-E-F-G (or 2-5-9) 
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Table 7.2. The working and protection path-pairs existing in the network. 
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Fig. 7.8. The working path-link incident matrix A and the backup path- 
link incident matrix B for the data shown in Fig. 7.7, with the working and 
protection paths given in Table 7.2. 

Without using the matrix method, the spare capacity for each link can 
easily be over-requested. An example is shown in Fig. 7.9, where there are 
three working paths in the topology: path a (A-B-C-D-E), path b (A-B-D), 
and path c (B-D-E), each of which has corresponding protection paths A-F- 
G-E, A-F-G-D, and B-F-G-E, respectively. If we simply sum up the amount 
of spare capacity along the spans reserved by different working paths sharing 









166 



the same risk of single failure, the spare capacity required along span F-G is 
3 given that the three working paths have a uniform bandwidth “1”. 
However, with the matrix approach, the spare capacity required along span 
F-G for 100% restorability is only 2 (interested readers can use the matrix 
approach to verify this result). This situation happens frequently, especially 
when non-shortest paths are taken during the routing process. 




Fig. 7.9. An example demonstrating how the matrix approach can 
achieve better protection resource sharing. 

When a flow is launched into (or removed) from the network, an offset to 
the spare provision matrix C, (or C" ) is calculated by directly applying 
the matrix operation exemplified in Fig. 7.8. For example, if flow b is 
removed, the source node of flow b (i.e., node A) will have to calculate the 
matrices A’ and B’: 
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so we get C 
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The other nodes that receive C~ then can derive the most updated spare 
provision matrix by calculating C C - C" . 

The above method is redundant in re-calculating the spare provision 
matrix after any update of link-state. The second approach for deriving C' is 
through a linear operation of the matrix. We get: 

C-=BJ-A,, (7.8) 



where A, and B, are the working and backup path-link incidence vectors 
for working path r. Therefore, the above exanq)le can also use the working 
and backup path-link incidence vectors for flow b to derive C‘ : A,. = [0, 0, 
0, 0, 0, 0, 1, 0, 0, 1] andB, = [0, 0, 1, 0, 0, 1, 0, 0, 1, 0], and. 
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With the linear property of the whole expression, it is easy to see that: 



C = '^C 
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The second approach for deriving the offset spare provision matrix is 
much more computationally efficient than the &st one, and is highly 
recommended. 

7.3.2 Basis of Path-oriented Spare Capacity Allocation 

The restoration can be performed on the basis of each working lightpath, 
in which an end-to-end physically disjoint protection lightpath is sought for 
each working lightpath. There are a couple of alternative strategies in 
implementing the path-oriented spare capacity allocation with the traffic 
demand matrix given in advance. The first is to jointly allocate working and 
protection capacity [12], and is aimed at achieving a global optimal solution 
by inspecting all design space using an optimization tool such as Mixed 
Integer Programming (MIP) or Multi-Commodity Minimum Flow (MCMF). 
The optimization process is no longer linear since the derivation of the 
minimum spare capacity along a span needs to find either the maximum or 
the sum of the working capacity that uses the span for protection. This 
process cannot be solved by any ILP solver. There are some constraints 
imposed on the selection of working and protection paths, such as that the 
working path and its corresponding protection resources have to be node- 
disjoint; or the SRLG constraint, which stipulates that two working paths 
traversing through at least one common physical element cannot share the 
same protection resources. Whenever there is a rearrangement of working 
and protection capacity, these constraints are the key factors that need to be 
considered. Therefore, the computation complexity is intolerably high in 
some cases. 

To show the InP formulation for path-oriented spare capacity allocation. 
Consider a directed graph (SI, L), where N and L is the node set and link set 
for the network. The total numbers of nodes and links are N and |l| , 
respectively. For every link / e L , c, is taken as the unit spare capacity cost. 
Let WD denote the set of traffic flows. Since a node pair can only take one 
physical route to solve the traffic demand upon it, there are at most 
Ny.{N-\) physical working paths co-existing in the network if bi- 
directional point-to-point traffic flows are considered. Traffic flow r 
(re WD) requires units of traffic demands with 100% restorability for a 
set of specific failure scenarios. The path-based protection with a link- 
disjoint restoration scheme can be represented as a path-link model where 
the design variables are indexed by the protection paths for traffic flows and 
links in the network. For each re WD , let P' denote a candidate set of loop- 
less protection paths which are link-disjoint from a given working path. The 
path set P'' is pre-calculated from the network topology and the working 
path r. The node-disjoint or link-disjoint path set P" for a specific working 
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path can be derived by using the diverse routing algorithm introduced in 
Chapter 7, or a k-shortest paths ranking method such as Yen’s algorithm 
(See Appendix A). The pth path in P'" is represented by the binary path-link 
indicator s,'-’’ which gives a value of “ 1 ” if the pth path uses link /, and “ 0 ” 
otherwise. If only the case of single link failure scenario is considered, let 
WDf denote the set of traffic flows affected by the failure of link/. WDf is 
simply the set of working paths traversing the link/. Let x'^’^’be a binary 
decision variable which takes “ 1 ” if the pth path inP''is used to protect the 
working traffic of flow r. The parameter 5 , is taken as the required spare 
capacity on link I to protect against any single link failure. 

The InP formulation for the path-oriented spare capacity allocation 
scheme has an optimization objective: 

min c, -5, (7.9) 



Subject to the following constraints: 



VreWD, 

peP' 



(7.10) 



S 






r^WDf 



peP' 



(7.11) 



Si = Integer, jc'"’'’ =0 or 1 (7.12) 

The objective function in Eq. (7.9) is to minimize the total cost of spare 
capacity in the network. The constraints Eq. (7.10) and Eq. (7.12) guarantee 
a single and integral protection flow which is link-disjoint and is prepared 
for every possible failure upon any working flow. The total spare capacity 
required on link I is determined in the constraint of Eq. (7.11), which picks 
the maximum spare bandwidth of each working flow on link / under a single 
failure scenario. While searching for the optimal or near-optimal solution 
with the Integer Programming formulation outlined from Eq. (7.9) to Eq. 
( 7 . 12 ), adding a protection path and removing another at a same time is a 
basic operation, which has been defined in the matrix method introduced in 
the previous section. 
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To overcome the time-complexity problem, heuristic and sub-optimal 
approaches are necessary when dealing with large-sized networks. We hsted 
several reported schemes for solving the scalability problem, which included 
a relaxation of the integer constraint. Local Search, Simulated Annealing 
[10], and Successive Survivable Routing [13,14]. Note that all the heuristics 
and the approximating methods toward optimization are also applicable to 
the span-oriented spare capacity allocation schemes introduced in the 
previous section. However, due to the large amount of computation time 
requested by the path-oriented schemes, these approximating approaches are 
seen as being much more important and necessary. 

7.3.2.1 Relaxation of Integer Constraint: 

We can relax the constraint that stipidates x’’"’’ must be an integer as 
shown in Eq. (7.12). The results from the relaxed InP formulation can 
provide a lower bound on the optimal solution. However, with the 
relaxation, the solution yielded by non-integer can never be applicable to 
the original InP model. The physical meaning hidden behind is that a 
working path cannot have multiple protection paths, although the protection 
paths have the same total bandwidth as the working paths. In addition, even 
though the relaxation process improves the computation efficiency of the InP 
case, it still needs a relatively large amount of time in some situations. 

73.2.2 Local Search 

The target function of the above InP formulation can be rearranged as: 



Cost(x" '’ ) = min V c, • 

leL 



max 

/6M/) 



(E 

peP" peP" 




(7.13) 



Eq. (7.13) can be easily evaluated in search-based heuristics. A local 
search is a general approach to solving an optimization problem, in which 
the algorithm guarantees that the final solution is better than any of its 
“neighbor” states in the search space. In this case, the search space is 
determined by |WD| and |p"|, in which working path r will have to select a 

protection path belonging to P ' . Therefore, there are ni P'’| different 

i^WD 

solutions. Note that each working path is prepared with a collection of 
protection paths. Each combination of the selections of the protection paths 
is called a state. For the local search, a start point is set to derive a total cost 
Cl by using Eq. (7.13) (e.g., select the first candidate protection path for all 
working paths in WD at the beginning, and is termed state 1). Theii the 
searching algorithm tries a neighbor state (e.g., select the second candidate 
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protection path for the first working path and select the first candidate 
protection paths for all the other working paths in WD, and is termed state 2) 
and derives a total cost again (i.e., C2). In the same way we can derive C3 by 
selecting the first candidate protection path for all the working paths except 
the second, which is assigned with the second protection path (termed state 
3). If C2 < Cl and C2 < C3, and state 1 and 3 are the only neighbor states 
defined for state 2, we will take state 2 as the optimal state of the search and 
C2 as the minimum cost. 

The searching process can also be performed through Genetic Algorithm, 
where each state is represented by a binary code series. It can easily be seen 
that the definition of the initial state and the neighborhood of a state are 
critical to the performance in this scheme. The simple example in the 
previous paragraph has two neighbor states for state 2. Another extreme case 
is that all states are neighbors to each other, which is nothing but a global 
search since the “local minimum” of a state can only be determined after 
inspecting all the other states. 

Except in a very special situation such as when the target function has 
only a single convex by a well defined neighborhood, the local search may 
yield poor performance. Note that the calculation of spare capacity on each 
link can employ the matrix method presented in the previous section. When 
the search is moving forward on each state the backup path-link incident 
matrix B is changed, which results in a new spare provision matrix C and a 
new vector of spare capacity reservations (i.e., the term: 
max { y* (m • V Sf’’’ • in Eq. (7.12)). As a consequence, the total 

feL-[l] „r 

' peP' peP'^ 

cost is changed as well. 

7.S.2.3 Simulated Annealing: 

The main drawback of the local search algorithm is that it only obtains a 
local optimal. Simulated aimealing prevents an algorithm from getting 
trapped in a poor local optimal, and allows searching steps that worsen the 
current solution. This character is the essential difference between the local 
search and simulated annealing. The probability of the searching mechanism 
taking a worsened step depends on the amount of dis-improvement, the 
current value of the solution, and the time spent so far in exploring the state 
space. 

Eq. (7.13) is a cost function defined on a finite set of { x ’’'’’ } feasible 
solutions, which is subject to the constraint in Eq. (7.10). Every state 
M€ has a set of neighbor states Neighbor(u) c. } . We assume 

that the neighborhood relationship is symmetric (i.e., ue Neighboriy) if and 
only ifv6 Neighbor(u)). Every move in searching is an inspection on all 
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the neighbor states. Let the probability of moving from state u to state v is 
. It is clear that > 0 and ^ = 1 • Once v is selected, the 

veN(u) 

difference C(v) - C(u) is calculated. If C(v) > C(u) , then we move to state v. 

C(v)-C(u) 

If C(v) < C(m) , then we move to state v with a probability e ^ , or 

otherwise stay in state u. Here T is a positive constant commonly called a 
control temperature, which was motivated by a certain physical analogy. 

The control temperature T determines the possibility of cost increases: 
when T is small, cost increases are highly unlikely; when T is large, the 
value of C(v)-C(m) has an insignificant effect on the probability of any 

C(v)-C(u) 

particular transition, and the value of e ^ goes toward 1. As a result, it 
is very probable to move to the next state even if it has a higher cost. To get 
a better optimization result we may set a larger T and search for a larger area 
of state space at the expense of a longer computation time. To compromise 
between computation time and optimization quality, temperature can be 
varied with time according to a temperature schedule T(t). In theoretical 

c 

studies, schedules of the form T{t) = are commonly considered [10], 

logr 

which makes the temperature T(t) decreased as time elapses. Therefore, 
tolerance of cost increase for a move at the beginning of the searching 
process is high as the value of T(t) is large, and decreases as the value of T(t) 
gets small. To further control the time required for a searching process, an 
upper limi t on the number of moves without any improvement before 
stopping the searching process, b, can be imposed according to the 
application requirement. 

For a concrete example using Eq. (7.13), a move can be defined as the 
selection of a traffic demand pair uniformly from working path set WD, and 
the alternation of its current backup path for the solution of the next move. In 
research conducted by Y. Liu [13,14], in order to select a new backup path 
for a move, the backup paths in P*" prepared for working path r are sorted 
according to their hop count and indexed accordingly. The index of each 
backup path is weighted by a random variant in the range [0..1] whenever a 
move takes place. The next move for the working path r will select the 
backup path with the smallest product. With this method, shorter backup 
paths will have a larger probability of inspection. The control temperature 
decays in accordance with the relationship T„=a- T,,, , where 

0.9 <a< 0.95 . It is clear that the value of a determines how slow the value 
of T decreases. T is updated periodically after a fixed number of accepted 
moves. 
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The simulated annealing approach is reportedly much more computation 
efficient than directly solving the InP at the expense of optimality. The other 
extra cost of using this method is that the searching parameters need to be 
tuned (i.e., control temperature T, the number of acceptances for an update 
of T, and the initial value of T, etc), which is a task requiring much 
experience and some luck. The parameters may significantly affect the 
optimization quality and convergence time. 

7 J.2.4 Successive Survivable Routing 

Successive Survivable Routing (SSR) allocates spare capacity 
sequentially for each working path in the network. Although this strategy 
may easily lead to a local optimal solution, it is attractive in that computation 
complexity is easily made adaptive to optimality by using a multi-trail 
statistical approach (introduced in the following paragraph). With SSR a 
working path is allocated into networks followed by its corresponding 
protection path. The whole task is finished after all the working paths are 
processed, one after the other. It is clear that the system must keep track of 
the link-state while allocating spare capacity for each working path. The 
selection of a backup path for the current working path strongly depends on 
the current link-state in the network. Therefore, the order of working paths to 
be processed is critical to the final result. Unlike the local search, in SSR 
only a single state is inspected for each sequence of working paths. 

When using SSR to perform spare capacity allocation we can conduct 
inspections on randomly shuffled sequences of working paths. The final 
result is selected among all the derived results. As the number of trials 
increased, it is more probable that a better result can be derived. This is also 
known as the multi-trail statistical approach. For example, if there are N 
working paths co-existing in the network, the total number of permutations 
is N factorial (or N!). The algorithm will randomly select a proportion of the 
total permutations for inspection and will pick the one yielding the best 
performance. It has been reported that the SSR can achieve a very time- 
efficient spare capacity allocation with relatively better capacity-efficiency 
than the other heuristics [13]. The scheme is flexible and adaptive, which 
balances computation time with throughput by manipulating the number of 
different sequences to be inspected. Note that the cost function and the link- 
state adopted for solving the protection paths are also critical. In general, 
Dijkstra’s shortest path first algorithm can calculate the “shortest path” 
(based on the network topology with well-defined link cost), which can 
simply be the hop count, or the length of the link, or a joint consideration of 
residual capacity along a link. This topic will be further addressed in Chapter 
8, where survivable routing schemes dealing with dynamic traffic are 
detailed. 




174 



Since SSR can be extremely fast it can easily be used in networks with 
dynamic traffic. A distributed computation approach is made possible by 
decentralizing the control of each path pair to the source node of the 
corresponding flow. When a new connection request arrives, a working path 
is routed by Dijkstra’s algorithm between the source and destination. The 
corresponding protection path is derived by erasing the links of the working 
path from the network topology, and then performing Dijkstra’s algorithm 
on the reduced network topology once again. This procedure is also called a 
two-step-approach algorithm. In terms of control and management, every 
node calculates the spare provision matrix of the whole network by keeping 
track of the sourced flows (i.e., a Hpx|L| matrix is yielded, where wp is the 
number of working paths originated from the node, and |l| is the number of 
directional links in the network). Whenever there is any change to the traffic 
flows, an update to the spare provision matrix in each node, or C" , is 
broadcast by the source node. 

7.4 Inter- Arrival Planning 

This section presents Inter-Arrival Planning, which is comprised of a 
suite of control and management mechanisms performed on-line to 
coordinate the entire network and improve the network throughput. Inter- 
arrival planning is characterized as planning efforts conducted between two 
consecutive network events that may change the network link-state (which 
can either be a request for setting up or tearing down a lightpath). This 
section will introduce the issues of algorithmic implementation for a general 
inter-arrival planning. 

Inter-arrival planning is interested due to the following reasons. The 
traffic distribution is changing in a more discrete way in optical networks 
than in the IP or MPLS networks. In the latter, millions of sessions or LSPs 
may co-exist in the networks; therefore, a threshold of link-state variation is 
defined in each node. If the link-state variation is above the threshold, link- 
state dissemination is activated. In the former, a lightpath is much more 
expensive and static than an LSP, and a significant time interval between any 
two consecutive arrivals of network events can be expected (e.g., in an order 
of several seconds to minutes). Inter-arrival planning is an on-line planning 
effort intended to increase the probability of successfully setting up 
subsequent connection requests. We need a complete description on the 
service entities and mechanisms of inter-arrival planning so that a 
quantitative study can be conducted to optimize network performance by 
taking traffic dynamics into consideration. 
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The time interval between two network events is inversely proportional 
to the traffic dynamics. If the Poisson model is taken, the average inter- 
arrival time is defined as: 



rM = 



1 



aiia.j) 



(7.14) 



where Tja is a random variable representing the inter-arrival time between 
two consecutive network events, and m/ is the number of lightpaths co- 
existing in the network. Without loss of generality, we assumed that the 
departure rate of each lightpath is jt regardless of the source and destination 
of the lightpaths. The total departure rate of lightpaths existing in the 
network is determined by the number of lightpaths multiplied by the 
departure rate of a single lightpath. 

In the following paragraphs we will discuss link-state coherence, 
correlation of SRLG relationship, protection path rerouting, and spare 
capacity re-allocation with a special emphasis on the issues of scalability and 
class of service. During inter-arrival planning, coherence between link-state 
databases of each OSPF entity is performed first. The link-state coherence 
may include the tasks of criticality information coordination and 
dissemination mentioned in Chapter 4, which facilitates the criticality 
avoidance path selection process. The correlation of the SRLG relationship 
allows the networks to dynamically allocate disjointed working and 
protection path-pairs according to the current link-state, and to conduct a 
path-oriented spare capacity re-allocation process. The rerouting mechanism 
and spare capacity re-allocation can be conducted to improve the extent of 
resource sharing between different protection paths. Some other tasks for 
facilitating network control and management can also be performed if 
required, such as the derivation of bottlenecks between each S-D pair or the 
correlation of the SRLG relationship between each working path. 

7.4.1 Link-State Coherence 

Link-State coherence has been discussed in the previous chapters, which 
concluded that link-state on-demand is the most suitable scheme for the 
optical networks, in which the per-wavelength link-states are broadcast by 
each node through a minimum balanced tree to all the other nodes whenever 
a node is subjected to traffic variation. The Asjmchronous Criticality 
Avoidance (ACA) protocol (see Chapter 4) can be performed at this stage, 
where each coordinating node disseminates the criticality information update 
message on-demand. 
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With the MPLS-based control plane where distributed control is 
provided, each source node requires sufficient knowledge of topology and 
traffic distribution before a routing decision can be made. According to the 
routing scheme adopted, the link-state to enable the path selection process is 
different. In case a fully adaptive routing scheme is adopted (e.g., ADR 
introduced in Chapter 3), the source node has to gather complete topology 
information (e.g., wavelength availability of each channel) of the whole 
network. In this case, protection paths can also be calculated with the 
network-wide link-state. On the other hand, for cases with partial-adaptive 
routing schemes, such as Fixed- or Fixed-Alternate Routing, the source node 
only needs to have the link-state along associated alternate paths among 
which the diversely routed working and protection path pair is selected. The 
probing mechanism mentioned in Chapter 4 is adopted to gather the per- 
wavelength link-state along pre-defmed alternate paths. 

7.4.2 Correlation of Shared Risk Link Group 

Protection is realized in the network optical layer by pre-arranging spare 
wavelength channels for working lightpaths. Shared protection allows two or 
more working lightpaths to take the same spare channels for protection 
purposes provided the working lightpaths do not traverse any physical 
network element in common. The lightpaths traversing the same physical 
element are in the same Shared Risk Link Group (SRLG), which may be 
subjected to a service interruption after a single failure occurs upon a 
network element. SRLG constraint stipulates that two working lightpaths in 
the same SRLG cannot occupy the same spare channel for a protection. The 
purpose of following the SRLG constraint is to keep 100% survivability for 
a single failure. 

The correlation of the SRLG relationship for each working lightpath is 
performed after completion of the link-state coherence, and should be 
completed before the next network event arrives to initiate a link-state 
update. The correlation of the SRLG relationship is activated in a node in 
charge of one or several directional links (called correspondence links to the 
node). The node maintains a spare provision vector for each correspondence 
link. The spare provision vector of a link records the demand of spare 
capacity upon all links in the network from the working paths traversing 
through the li nk. If we are given a working path and required to find the best 
protection path, we can first of all gather the spare provision vectors 
maintained in each node that is traversed by the working path, and then 
derive the link-state for solving a corresponding protection path of the 
working path. Since a single failure scenario is assumed, different links can 
never belong to the same SRLG. The amount of spare capacity along a 
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specific link in the network that is prohibited from the protection path is the 
maximum of the spare capacity demand along each link traversed by its 
working path. With this information, we can easily figure out which links 
have sharable spare capacity and which are not in the network. Further and 
more detailed discussions will be provided in Chapter 8. 

An example of correlating the SRLG relationship is given in Fig. 7.10 
using the network shown in Fig. 7.10(b). In Fig. 7.10(a), Qg, C^p and 

Cpg are three spare provision vectors maintained in nodes S, E, and F, 

respectively. Qg is equal to Cgg because the two links S-E and E-F are 

traversed only by the same working path Wl. When a working path is given, 
for example, from node E to G, the prohibited spare capacity along each link 
is the maximum of the spare capacity requirement along the two links E-F 
and F-G, and is shown as follows: 

Links of the entire network 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

SZf =[2 222000210oooo20 0] 



The infinity notation on link 11 and 12 stipulates that the protection path of 
the given working path cannot traverse the same physical links. We can 
derive the sharable spare capacity along each link by inspecting the 
difference between the total spare capacity and the prohibited spare capacity 
along a link. 



Links of the entire network 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

Oef=^se= [ill 10001100010 O] 

Links of the entire network 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

CpG= [2 2220002100020 O]^^^ 
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1 2 3 





Fig. 7.10. (a) Shamble spare capacity along each network link; (b) An 
example showing the backup path-link incident matrix of W1 stored in node 
S in (c). 

If a new working lightpath traverses any one of the links 10, 11, or 12 in 
the next network event, the corresponding spare provision matrix will then 
be updated after the coherence of link-states. An example is illustrated in Fig. 
7.10(c), where a new working path S-E-F along with its protection path S-A- 
B-F is allocated into the network. The spare provision vectors for links 10 
and 11 maintained by nodes S and E are subjected to an update. Since only a 
single link is kept in a spare provision vector, the update can be performed 
by summing up the demand for spare capacity on each network link by the 
working paths traversing links 10 and 11. Therefore, after the allocation of 
W3 and its protection path S-A-B-F, C^g and Cgp are updated to: 
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CsE = C ^ F = [l 1110001100010 o] 

+ ^00010001000001] 

= [111110012100101] 

If the next network event is a connection request which arrives before the 
completion of the link-state coherence, a stale link-state problem may occur 
as the call is allocated. The assun^tion can be made that the dynamic 
survivable routing and wavelength assignment process for the next 
connection request is performed based on the link-state with the correlated 
SRLG information, since the link-state coherence can be implemented in a 
time-efficient manner. The probability of failing to finish the link-state 
coherence before the arrival of the next connection request should be very 
small. As we have mentioned, the stale link-state problem may both increase 
the chance of a resource reservation conflict and damage the provisioning 
speed of lightpaths. 

7.4.3 Rerouting of Protection Paths 

A rerouting can benefit network capacity efficiency whenever a lightpath 
is disconnected from service. An example is given in Fig. 7.11, where a 
multi-fiber network is provided. The notation WP (Working path | Protection 
path) is adopted in the following discussions to denote a working and 
protection path-pair. Three lightpath segments on the same wavelength 
plane, WP, (S2-G-D3|S2-A-B-C-D2-D3), WP 2 (S1-E-F-D1|S1-H-I-J-D1), 
and WP 3 (S1-E-F-D2|S1-A-B-C-D2) are in the network. Since the working 
paths of WP 2 and WP 3 (i.e., P2 and P3) share the same risk in the links Sl-E 
and E-F, they cannot take the same spare wavelength channels along the path 
S1-A-B-C-D2. Therefore, wavelength channels along Sl-H-I-J-Dl are 
required to be reserved for P2. Now, if WP 3 is tom down, the SRLG 
constraint no longer exists between P2 and P3. As a result, the protection 
path of P2 can be reallocated to S1-A-B-C-D2-D1, which reduces the 
number of reserved spare links from 9 to 7. 

It is important to note that the protection paths that need to be rerouted 
are those that have a corresponding working path in a same SRLG with a 
disconnected working path. Therefore, an inspection of all these working 
paths in the network is needed in order to perform a rerouting. Rerouting is 
performed in a sequential manner with a pre-defined order. After the 
completion of the SRLG correlation, the algorithm solves a disjointed 
protection path for each working lightpath in the same SRLG with the 
disconnected lightpath. 
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Fig. 7.11. An example of rerouting for deriving the protection paths after 
the tear-down of a W-P path-pair. 

7.4.4 Spare Capacity Re-allocation 

Although the rerouting mechanism for protection paths can save spare 
capacity, a further improvement can be made to the spare capacity 
reallocation process by considering more path-pairs at a time. An example is 
given in Fig. 7.12. Assume there is another working and protection path-pair 
WP 4 (L-M|L-H-I-J-M) on the same wavelength plane as PI, P2 and P3. After 
the rerouting of the protection path P2, an improvement in capacity 
efficiency can be made by rerouting WP 4 , which has nothing to do with 
WP 3 . It is clear that the spare capacity reallocation needs to be performed in 
a global manner to achieve a better design. 




Fig. 7.12. An example demonstrating that the rerouting mechanism may 
leave room to improve. 

We try to further reduce the consumption of spare capacity. The 
techniques introduced in Section 7.3.2 for approximating optimal spare 
capacity allocation provide a platform for trading-off optimization quality 
for computation complexity by adjusting the control parameters. An example 
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can be seen in the implementation of the simulated aimealing, where the 
decay constant a and the number of acceptances for an update of T can 
determine the trade-off between quality of searching result and computation 
time. Another example is the statistical multi-trial technique for SSR, where 
the number of trials with different sequences can be manipulated to 
determine the trade-off between searching result quality and computation 
time. The above examples show flexibility and adaptability to different 
network environments and design requirements. 

Note that the length of time between two network events is inversely 
proportional to the traffic dynamics in the network. If we neglect the latency 
consumed by the link-state coherence, and assume that Poisson arrival of 
connection requests and an exponential distribution function of holding time 
for each lightpath are adopted, the probability of performing a successful 
spare capacity reallocation in inter-arrival plaiming is: 









, where ji and >1 , . are 



the departure and arrival rates of lightpaths, and Treaiio is the time for 
completing a reallocation process. 



Fig. 7.13 summarizes the inter-arrival planning process. Note that if the 
rerouting or spare capacity reallocation cannot be completed before the 
arrival of the next network event, the rerouting or reallocation process may 
yield an invalid answer, and the inter-arrival planning is thus interrupted. In 
this case, if the subsequent network event is a connection setup, the rerouting 
or spare capacity reallocation process must be restarted for the current newly 
established lightpaths. On the other hand, if the network event is a lightpath 
tear-down, the unfinished rerouting or spare capacity reallocation process 
will either continue with stale link-state or get discarded. 
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Fig. 7.13. The flowchart for performing inter-arrival planning. 

7.5 SLSP with Spare Capacity Re>allocation 

This section will present a particular implementation of the path-oriented 
spare capacity allocation scheme, called SLSP with Spare Capacity Re- 
allocation (SLSP-R), to deal with networks with dynamic traffic. Detailed 
analysis and discussions are provided. 

7.5.1 Introduction 

Short Leap Shared Protection (SLSP), with its basic architecture and 
signaling mechanisms, was discussed in Chapter 6. Based on the SLSP 
framework, SLSP-R is aimed at solving the problem of capacity-efficient 
spare capacity re-allocation with the InP formulation, and takes both 
computation complexity and network dynamics into consideration. With the 
SLSP framework, protection domains (P domains) are allocated along a 
working path in a cascaded manner, which facilitates both link- and node- 
protection. Due to the fact that spare resources are pre-planned instead of 
being pre-configured, the spare links can accommodate best-effort traffic at 
normal operation. An intrinsic difference between SLSP-R and the other 
span-oriented spare capacity allocation schemes is that SLSP-R considers the 
wavelength continuity constraint, in which working and protection paths are 
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required to be on the same wavelength plane. As a result, the loop-back 
mechanism is performed for traffic restoration without experiencing 
wavelength conversion. Another important characteristic of SLSP-R is that it 
is devised to deal with dynamic traffic by compromising optimization 
quality with computation efficiency. Therefore, the SLSP-R scheme is 
adaptive to network environments with different service requirements. 

We assume that the PSL-PML pairs along each working path have been 
determined off-line, and will not be changed. There are three steps for the 
implementation of the SLSP-R algorithm: 

1. Search for all the cycles in the network topology up to a limited size (i.e., 
hop count) ^fy\cy E CY }. The cycle listing algorithm can be seen in 
reference [9]. 

2. Select cycles from CY such that the cycles exactly contain the path 
segment (PSLrj - PMLr,,), where 0<i<d^, and dr is the number of P 
domains along working path r . The cycles in CYrj are termed feasible to 
the P domain i of working path r. 

3. Optimize the deplo 3 onent of cycles, which is simply formulated as an 
Integer Programming (InP) problem: 

Objective: 

minVc, -5, (7.15) 

rr ‘^1. ■" 



Subject to the following constraints: 



^ = 1 , for all i and r (7.16) 






m. 



r^WD, 



■ 



X, 



r,cy 



L for all I, r and i, / e L 



(7.17) 



Si = Integer, x!'''^ =0or 1 (7.18) 

where cj and sj are the cost and the amount of spare capacity on the span 
Ie L , respectively. Note that c, can be designed to reflect the topology 
information or residual capacity along link J. The cycle set CY^ i is pre- 
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calculated from the network topology, working path r&WD, and the pre- 
defined PSL-PML pairs along working path r. Let Ji:[’‘^be a binary decision 
variable which takes “1” if the cy-th cycle in CY^jis taken to protect the P 
domain i along working path r. The binary cycle-link indicator gives a 

value of “1” if the cy-th cycle for P domain i along working path r uses link 
I, and “0” otherwise. If the single failure scenario is considered, and WD/ 
denotes the set of traffic flows affected by the failure of link /, WD/ is simply 
determined by the set of working paths traversing link/. The working path 
segment in P domain i for working path r is defined as the path segment 
(P5L,i -PSL^j^i) for l<i<(d^ -1) or (PSL^j )> shown in Fig. 

7.14. The symbol is the scenario of restoration level for working path r. 
In general, a single failure scenario is desired and yields = 1. 



Protection domain ! Protection domain 2 Protection domain 3 




Working path segment Working path segment Working path segment 

for P dometin 1 for P domain 2 for P domain 3 



Fig. 7.14. The correspondence of a working path segment to a P domain 
along a working path. 

The goal in Eq. (7.15) is to minimize the cost of spare capacity deployed. 
The constraint in Eq. (7.16) defines the mnnber of P domains along working 
path r. The constraint in Eq. (7.17) puts a lower bound on the amount of 
spare capacity needed to achieve full restorability. Eq. (7.18) guarantees an 
integer solution. Note that in Eq. (7.16), an extra constraint is added so that 
the optimization for the P domains along a working path is performed 
sequentially on each P domain. Once an optimal cycle is found in a P 
domain, the other P domains along the working path will use this result to 
find their own best cycles. For example, if there are three P domains along a 
working path, and each of which contains 3 candidate cycles, then 9 
inspections are made for the different cycle combinations instead of 27. 

The search space in the above formulation has a con^lexity of 

is the number of feasible candidate cycles 

mWD i=l 

for P domain i of working path r, where \<i<d^. Here is an example that 
roughly demonstrates the computation complexity of performing the spare 
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capacity with each working path being fragmented. Given a 30-node mesh 
network with 5 fibers in a direction along a link, and jCF,. , | = 4 for all r and 

i, |WZ)| = 50, and = 2, the search space can be as large as (4 + 4)^® , 
which is intolerable to most of the cases. To make matters worse, the 
complexity is exponentially increased as the number of working paths |WD| 

is increased. To solve the scalability problem, heuristics is strongly solicited 
and will be introduced in the following section. In the rest of this section, we 
will present a framework for optimizing network performance by taking both 
traffic dynamicity and computation complexity into consideration. 



7.5.2 Interleaving of the Working Paths 



In order to reduce the computation complexity of using the InP 
formulation, SLSP-R interleaves the optimization process into several 
sequential sub-processes, computation complexity is reduced from 

dj. m d. 

«niicn,i) <0 o(S n 21 CF,.,.|) , where m is the munber of 

rsWD I y=l r^Sj i 

SjCWD 



subsets by the sub-grouping, and Sj is the subset of working paths, where 
'\WD\' 

1 < j<m and m = ' 



Mw 



. In other words, several subsets of working 



paths are optimized one after the other. 



It can easily be seen that with the same number of working paths, the 
larger a subset is the better performance can be achieved at an expense of 
computation time. Therefore, the purpose of interleaving the working paths 
is to compromise between computation time and optimization quality. After 
optimizing a subset of working paths, the link-state database is updated 
according to the optimization result, upon which subsequent optimization 
processes for the other subsets of working paths can be based. The following 
expressions model the computation time where a successful optimization 
process is completed before the next network event arrives: 



m d, 

'^reallo ~ ^ ’ xnzi CYj.^ + overhead 

;=1 i 

SjCWD 



rpS 



>z 



reallo 



(7.19) 
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where Treaiio is the computation time for corq)leting an optimization process 
of all the subsets, Tg is the time interval between the s-th and the (s+l)-th 
network event, which is mainly determined by the number of working path 
in the network, and a is a constant associated with the computational 
capability of the system. For simplicity, the following discussions on 
performance modeling will be based on the assumption that the number of 
feasible cycles |cr,. i| = Y for all r and i, and d^=d for all r, and |5^| = Mw 
for all j, where Mw is the total number of working paths subject to traffic 
variation. Overhead is independent of network size and is ignored. From Eq. 
(7.19) we get Eq. (7.20): 



T = 

reallo 



Mw 



'^E ^ '^reallo 



(7.20) 



Note that cl • y is the average number of feasible cycles to be examined 
for a working path. Because the increase of Mw can increase the extent of 
resource sharing in each spare capacity reallocation while reducing the 
probability of completing an optimization process before the next network 
event arrives, an optimization for overall performance can be made by 
manipulating the number of working paths in a subset. Based on the arrival 
rate of network events and computing power, it is expected that there exists 
an Mwopt which gives the best performance. We will formulate this statement 
in Section 7.5.5 and verify it by simulation in Section 7.6. 

Before going any further, one may ask about the criteria for grouping the 
existing working paths into a subset. This section examines three strategies 
of grouping working paths in terms of physical distribution: 

1. Most Overlapped (MO): working paths with the maximum extent of 
overlapping are grouped together. 

2. Most Diverse (MD): working paths with the minimum extent of diversity 
are grouped together. 

3. Randomly Distributed (RD): working paths are grouped randomly. 

To go through each of the design approach, we introduce in the following 
paragraphs a suite of greedy algorithms, hi addition to the physical 
distribution of the working paths, the algorithms are required to contain the 
same (or a similar) number of working paths in a subset so that the time for 
completing optimization for each subset can be as close as possible. In other 

words, the value of (d in Eq. (7.20) for each subset should be roughly 

the same. The symbol (<i • F)^”" is defined as the mass of a subset. 

For presenting the algorithm, an index, , is defined to quantify the 
amount of overlapping among a set of working paths, q, as 
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follows: A = , where Cvgq is the total coverage by the path set q, and 

j 

Hj is the hop count of the path j in the path set q. An example is illustrated in 
Fig. 7.15. Path set q has three paths, PI (A-C-F-G), P2 (B.-C-D-G) and P3 
(E-F-G-H), with a hop sum (or a mileage) of 9. The total coverage of g is 8 
(i.e., A-C, B-C, C-D, D-G, G-H, F-G, E-F, C-F). Therefore, A^ equals to 

8/9 or 0.889 in this case. 



P 5 P P 



P 



B P2 



H-.E P3 * 

P s 



^ -G° i" 



-di 



Fig. 7.15. An example for demonstrating the index, ^,for quantifying 
the extent of overlapping among paths. 



Algorithm 1: (MO) 

Input: T: the upper bound on the mass of a subset; WD: all existing working 
paths; 

WD^g^ouped ^ ungrouped at the beginning; */ 

j^U I* index of subset; */ 

While^^^ wheuever any path is ungrouped, iteration 

continues; */ 

Ao 4- find index of coverage of ; 



Sj 4— ; I* Sy jth subset of working paths, which has all paths in 

at the beginning; */ 

EX[ ] 4— true ; 

/* initialize a Boolean array with \wD ^ I elements to identify the status of 

•' j r r ^ mgrouped \ •' 



each working path in ; */ 

/* a round to determine subset j starts; */ 
While® (majj of 5^.>7)Do 



/ <— 0; /* index for keeping the most unwanted path; */ 



For {i 1 to WD. 



mgrouped] 



) 



If (EX[i] == true) Then 

EX[i] <— false ; /* path i is taken away temporarily; */ 
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A EX <— find coverage of paths with EX[ ] == true in ; 

>Aitd) Then ~ 

/* If the path excluded in this round is the relatively most “diverse” of all paths 
with EX[ ] = true, it will be taken away; */ 

^ ^EX ’ 

/* to this point, record the maximum value of coverage we can achieve by taking 
away one path; */ 

I i — i; 

/* record the index of the path that is the most diverse inspected so far; */ 

End If 



EXli] <- true; /* Change the status of path i back; */ 

End If 
End For 

If (7 = 0) Exit(O); End If; 

/* “7” can never be 0 since there is bound to be a path that can increase the 
average value of coverage; if so, program ends abnormally; */ 

EX[I\ <- false; 

Delete path 7 from Sj\ 

/* path 7 is kicked out since it is the most diverse path, which yields the largest 
coverage increase when taken away; EX[I\ is set to false so that path 7 will 
never be considered in this round */ 

End While® 

/* a round ends; Sj is generated as a new subset; */ 

Delete 5,. from ; 

y y + 1 ; /* subset y is newly formed; the rest of the paths in 

WDmgrouped Considered in the subsequent rounds; */ 

EndWhile^*^ 

Return S,y; 

Output: S: an array of subsets of working paths that will be optimized together; 
y: number of subsets. 

Algorithm 2: (MD) 

The only difference between Algorithm 1 (MO) and Algorithm 2 (MD) is 
the part located in the rectangle, which is rewritten to: 
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IflAgx <A^o) Then 

/* If the path excluded in this round is the relatively most “overlapped” among 
all the paths with EX[ ] == true, it will be taken away; */ 

^ ^EX ' 

/* To this point, record the minimum value of coverage we can achieve by 
taking away one path; */ 

/ < — /; 

/* record the index of the path that is the most diverse inspected so far; */ 

End If 

Algorithm 3 : (RD) 

^^mgrouped ^ ungTOuped at the beginning; */ 

j <— I', I* index of subset; */ 

While^*^ ( I > 0 ) Do /* whenever any path is ungrouped, iteration 
continues; */ 

Sj <- null; /* Sj : subset of working paths, which is null at the beginning; */ 
While® (mass of Sj< T) Do 

/ <- Randomly generate an integer with a range |] ; 

Add to Sj ; I* put working path I into Sj ; */ 

WD' , <- ; 

ungrouped ^ ^ ungrouped ’ 

/* move the last element in to position /; */ 

^ 5 the number of paths in 

End While® 

End WhiW*^ 

Return 5,7; 

Output: S: an array of subsets of working paths that will be optimized together; 
j: number of subsets. 

7.5.3 Optimization and Approximating Optimization 

We have introduced in Section 7.3.2 several methods for appro ximating 
optimal path-oriented spare capacity allocation. For verifying the SLSP-R 
scheme, we present a method using InP formulation as a basis for op timizin g 
each subset of working paths. The spare capacity is allocated into the 
network one group of working paths after the other. For comparison purpose, 
the local search (LS) and successive survivable routing (SSR) are also 
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adopted. The LS method is examined with different initial states and 
different neighborhood relationships among working paths. On the other 
hand, SSR is examined with different sequences of working paths with P 
domains to be allocated. In general, LS is similar to SSR in the sense that the 
searching process is greedy without a global consideration. 

With the working paths being sub-grouped, schemes for achieving 
optimal or approximating optimal spare capacity allocation are performed on 
each of the subsets one after the other. During the searching process, each 
pattern of arrangement for each P domain is iteratively inspected. Instead of 
using Eq. (7.16) to calculate the minimum amount of spare capacity required 
along a link, the matrix approach introduced in Section 7.3.2 can be adopted. 
In fact, the searching process described in Eq. (7.15) has the same function 
as the matrix approach in finding the minimum spare capacity along a link to 
achieve 100% restorability. The derived matrices (i.e., the working path-link 
incidence matrix A, the backup path-link incidence matrix B, and the spare 
provision matrix Q should be recorded and delivered to the optimization 
process of the next subset of working lightpaths. The following paragraphs 
introduce the four approaches in detail before a numerical study is presented. 

7.5.3.1 Integer Programming (InP) Formulation 

Optimization for spare capacity allocation based on the InP formulation 
listed from Eq. (7.15) to Eq. (7.18) is performed iteratively for each subset. 
The entire initial states for all r, cy and i are set to 0, which means that 

no cycle is selected at the beginning. To optimize the jth subset of working 
paths is equivalent to solving the InP formulation on a subset of WD, in 

which a subset of x.’‘^ , { } is programmed while leaving 

{ } fixed. After optimizing the jth subset, { } and 

{ } will be fixed and available for the subsequent optimization 

of subsets. With this devise, computation complexity can be largely reduced 
as discussed in Section 7.5.1. The manipulation of the size of each subset of 
working paths can initiate a compromise between the computation time and 
optimization quality (or the extent of resource sharing between protection 
paths), which can serve the dynamic model presented in Section 7.4.4. 



7.S.3.2 Local Search 

The target cost function is derived by combining Eq. (7.14) and Eq. 
(7.16), which yields: 
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max 



l'S,r 

reD„ cyeCK,., 




(7.21) 



The initial state is randomly selected from the search space. Each candidate 
cycle is indexed by (w, i, cy), where 1 < w < , and 

l<cy <|cy,j|. Let the neighbor states of state (a,h)are all (a’, b’) such that 

a-l<a’<a + l, and 1 < b’< , where a, b are integers, and 

i 

a\ b’do not hold at the same time. In other words, generally, a 
searching process is completed after finding a local minimum by inspecting 
all the combinations of P domain allocation on the two working lightpaths 
numbered next to the current working lightpath. The initial state for a 

searching process is generated by assigning “0” to alljr^’*^, and randomly 
assigning “1” to of each combination (w, i). The reason we need only 
^|CF,.,| moves to completely inspect a working lightpath instead of 

i 

is that the superiority of a candidate cycle for a P domain of a 

i 

working path will not be changed as we try different candidate cycles in the 
other P domains of the same working path. This is true because little 
correlation exists between the cases in selecting different cycles in different 
P domains along a working path. 



The parameter ^^,‘^in Eq. (7.21) determines whether or not link I is 
traversed by the cy-th candidate cycle in P domain i of working path r. The 



purpose of the term 



reD* cyeCY,^ 



in Eq. (7.21) is to 



perform a function equivalent to the matrix approach, which derives the 
minimum amount of spare capacity along a link to achieve 100% 
restorability. For this purpose, every node maintains the spare provision 
vectors for its correspondence links as described in Section 7.4.2. 



It is clear that not only the sequence of working paths in a subset, but also 
the sequence of subsets to be processed, may influence the final searching 
result. With the LS method, the best computation efficiency can be achieved 
by iteratively finding a local optimal in each subset of working paths with a 
random initial state. On the other hand, if we intend to trade the computation 
efficiency with a larger searching area for deriving possibly a better 
searching result, the algorithm should be multi-trialed to examine different 
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sequences of working paths in a subset and different sequences of subset to 
be processed. Since the LS approach with multiple trials intrinsically 
possesses the capability to adapt the computation time with the quality of 
searching results, the interleaving of working paths is no longer necessary. 

7.5.33 Successive Survivable Routing 

With SSR, each P domain of a working path is allocated with a protection 
path segment one after the other. Among all the cycles in ,• that have 

been prepared for each working path r and P domain i, the algorithm selects 
a cycle for each P domain along the working path according to pre-defined 
cost function and link-state metrics. We define cost function for the selection 
of cycles along a working path as: Cost-’^ = Extra spare links required, 

which can be determined by the matrix approach demonstrated in Section 
7.3.1. Note that we have assumed that the selection of protection path 
segments for different P domains is independent; as a result, the optimality 
in the cycle allocation process for a P domain can be achieved solely by 
inspecting all the candidate cycles for the P domain. By the linearly additive 
property shown in Eq. (7.8), the allocation of working and protection path 
segments (e.g., the bolded and dashed lines in Fig. 7.16) does not need to re- 
calculate the whole spare provision matrix. Instead, the contribution from the 
P domain i of the path pair r to the spare provision matrix C*j ^ is needed. 

As shown in Fig. 7.16, the allocation of P domain i (where i^d^) needs to 
consider the working path segment ( P5L^ ,• -X-Z- PSL,.^^^ ) and the 
protection path segment (PSL,.j -A-B-C-D-E) in the working and 
backup path-link incidence vectors (i.e., A^, and , ^ ). We can then 
derive the contribution of spare capacity from the protection path segment in 
P domain i to working path r : ~ ^r,i,cy ‘ A-j,cy • Comparing the 

original demand for spare capacity with the amount of spare capacity 
specified in the updated spare provision matrix, we will be able to derive 
cycle cy in P domain i on working path r, for all (r, i, cy), which yields the 
least amount of additional spare capacity. 
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P domain i 



PSLi X z PSL(i+I) PMU 



Fig. 7.16. An illustration of a working path segment defined in the matrix 
approach. 

SSR can also use the statistical multi-trial approach to improve capacity 
efficiency at the expense of conq)utation complexity. Therefore, the 
interleaving of working paths is not necessary in this case. 

7.5.4 Numerical Results 

Experiment results in Table 7.3 are derived to evaluate the performance 
of using different approaches of spare capacity allocation. The 22-node, 30- 
node, 79-node and 100-node networks adopted are shown in Fig. 4.16(a), 
Fig. 4.16(b), Fig. 7.17(a) and Fig. 7.17(b). The computation time for all the 
cases is up to a limit, which is 1, 2, 5, 10 seconds in each of the 4 cases, 
respectively. We assume an open capacity along each network link (i.e., 
there is no limitation on the link capacity). LS-N and SSR-N are 
abbreviations for Local Search and Successive Survivable Routing with N 
multi-trials. InP-M-N depicts the Integer Programming formulation with 
each subset of working paths containing N working paths, where a total 
number of M trials are tested. The last row of the table records the results of 
using the InP formulation without the interleaving of working paths, which 
gives the optimal results for the spare capacity allocation. The time 
consumed for each of the 4 networks is 1000, 1340, 3000, and 9000 seconds, 
respectively. 

A fixed number of working paths are first randomly allocated into each 
network as shown in the second row of Table 7.3, with PSLs and PMLs 
being well defined for each working path. The distance in hop count between 
PSL^i-PML^j cannot be longer than 3, 3, 4, 5 hops for each of the 4 

networks. In general, the upper limit on the length of a candidate cycle is 10 
hops, which can be derived by using the cycle-enumerating algorithm [9]. 

It is clear that using InP formulation with working paths interleaved in 
the most diverse (MD) maimer requires the smallest number of spare links 
when the time factor is held constant among all the cases examined. It is 
notable that the InP with working paths being grouped in MO manner is 
outperformed by all the other InP schemes, and some of the multi-trial 
schemes as well. We observe that although the InP formulation can yield the 




194 



best resource sharing among the working paths in a subset, it can hardly take 
advantage of this characteristic along with the MO scheme due to the SRLG 
constraint on resource sharing between working paths in a subset. On the 
other hand, in the MD case, working paths are grouped with the maximum 
diversity, in which more resource sharing can be explored by the InP 
formulation to yield a better result. 

7.5.5 Design for the Size of Each Subset (Mw) 

Since every network event (i.e., a connection setup or tear-down) 
changes the capacity distribution, the spare capacity re-allocation process 
will have to restart to follow a new link-state if the current re-allocation 
process cannot be completed before the next network event arrives. In this 
case, the rate (or frequency) of successfully performing a re-allocation 
process is also an index of network performance, which depends on E, the 
arrival rate of network events, and Tredio, the computation latency for inter- 
arrival planning (or Mw, which is directly related to computation time). This 
section verifies that there exists an Mwgpt (i.e., the size of each subset which 
can achieve the best performance) if the network environment is such that 
connection requests arrive one after the other without any prior knowledge 
of future arrivals. We will present an approach for designing the size of each 
subset when the InP formulation with interleaved working paths is adopted. 
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100% 
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100% 
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100% 


1230 


InP2-15 (MO) 


100% 
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100% 
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100% 
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100% 


1232 


InPlO-5 (MD) 


100% 


117 


100% 


209 


100% 


637 


100% 
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InP4-10 (MD) 


100% 
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100% 
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100% 


635 


100% 
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InP2-15 (MD) 


100% 


115 


100% 


206 


100% 
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InPlO-5 (RD) 


100% 
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100% 


213 


100% 


660 


100% 


1225 


InP4-10 (RD) 


100% 


122 


100% 


214 


100% 


657 


100% 


1218 


InP2-15 (RD) 


100% 


119 1 


100% ! 


217 


100% 
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InP 


N/A 


111 


N/A 


201 


N/A 


630 


N/A 





Table 7.3. Total spare capacity required for each network using different 
allocation strategies. 
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A successful inter-arrival planning process needs to be finished before 
the next network event arrives, which is assumed to be the only approach to 
re-configuring spare capacity in the network. Note that the inter-arrival time 
Tia of adjacent network events is also a Poisson random variable with a 
mean: 



m,j) 



where n^ J is the number of lightpaths existing between node i and j, and //. j 
and Ajj are defined in Eq. (7.14). The above expression also quantifies the 

network dynamics. We can derive the probability of a successful re- 
allocation as: 



pir ;>7’ p-freallalTB 

where Te is the average time duration between two events, and Treaiio is the 
desired time duration for the optimization of a subset. 

Therefore, the expectation of the number of network events for a 
successful inter-arrival planning process, NS, can be derived as: 

NS = , and therefore: 



"^redlo ~ '^E 



(7.22) 



On the other hand, in Eq. (7.20) we also modeled the total time needed for a 
successful spare capacity allocation, T„a«o, as: 



_ g-|WDl 

reaUo ^ 1 ’ 



If we require that the re-allocation process be successfully performed on 
average every NS network events, the following relationship can be given by 
combining Eq. (7.20) and Eq. (7.22), which yields an expression between NS 
andMw: 
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NS = exp 



a-|WD| 

Mw-T^ 






(7.23) 



It is clear that a small Mw may increase the successful rate of inter-arrival 
planning processes, where the performance is improved since Treaiio is 
decreased. On the other hand, a small Mw may also impair the performance 
due to a limited extent of resource sharing achieved in the reallocation 
process. Therefore, the performance in terms of throughput should be a 
function G{Mw) derived by aggregating the two functions, Gj{NS) and 
GafMw), i.e.. 



G(Mw) = Gi iNS)0G^ (Mw) = g' (Mw)0G2 (Mw) (7.24) 

where “0 ” is an operator that aggregates two functions on both sides. 

Assume Mw > 1 , we get > 0 , 

dNS 

dGANS) dGAMw) dMw ^ 
dNS BMw dNS 



^^3G,Ww)<o 

dMw 



(7.25) 



Since it is more than obvious that the larger the subset of working paths the 
better the quality of optimization, the following relationship exists: 



dG^(Mw) ^ Q 
dMw 



(7.26) 



From Eq. (7.24), (7.25) and (7.26), we can expect an Mwopt which yields 
the best performance at a specific network load, computing power, and 
arrival rate of network events, which will be further verified through 
simulation in the following section. 

7.5.6 Performance Evaluation 

The following experiments were conducted to verify that the use of 
SLSP-R can improve network performance in terms of blocking probability 
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of building up working and protection path-pairs. In this section, we will 
first introduce an on-line algorithm adopted in this simulation for 
dynamically allocating P domains for each connection arrival (The same 
topic will also be covered in Chapter 8, where a thorough discussion on 
dynamic survivable routing is provided). Then the assumptions of this 
simulation will be stated and the results will be presented. 

We examine the SLSP-R scheme in 22-node, 30-node, 79-node and 100- 
node networks as shown in Fig. 4.16(a), Fig. 4.16(b), Fig. 7.17(a) and Fig. 
7.17(b), respectively. The networks are assumed to have 2 fibers in each 
direction between adjacent nodes, which are contained in a conduit. Each 
fiber contains 8 wavelength channels with the same bandwidth. Each 
connection request is an establishment of a lightpath between two nodes 
equipped with an end-to-end shared protection service, which arrives 
according to a Poisson process and departs after a period defined by an 
exponential distribution function. All the lightpaths in a conduit are in the 
same SRLG (i.e., they are subject to the same risk of single failure). For 
simplicity, full wavelength conversion is allowed in the simulation. All 
through this simulation, we take the InP formulation with the most diverse 
(MD) working path interleaving as the approach of performing spare 
capacity re-allocation. 

7.5.6.1 Allocation of Protection Domains 

The simulation first needs an on-line P domain allocation algorithm to 
route the current connection request into the network. In this simulation, we 
use C-BAR (see Chapter 4) to plan alternate paths. As a connection request 
arrives, the destination node coordinates the lightpath allocation process by 
deriving the alternate paths from its routing table, probing the availability of 
wavelength channels along each alternate path, and assigning PSLs and 
PMLs to a specific set of nodes along the available alternate paths according 
to the restoration speed requirement. The nodes that behave as PSLs invoke 
the survivable routing algorithm to allocate the protection paths and form the 
corresponding P domains. 

The network shown in Fig. 7.16 provides a useful example of P domain 
allocation. As a connection request arrives, the destination node assigns 
nodes E and I as PSLs, and nodes F and J as PMLs, to meet the restoration 
speed requirement. Each of the PSL nodes A, E and I, derives the residual 
network link-state by excluding the whole working path (except its PML 
node) as well as the protection resources that violate the SRLG constraint 
fi"om the network. Then Dijkstra’s shortest path first algorithm is performed 
by each PSL to derive the protection path segment for its P domain. The 
source node coordinates the whole distributed computing process, and 
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selects the best allocation of P domains among all wavelength planes along 
all the alternate paths. 




(b) 



Fig. 7.17. (a) 79-node and (b) 100-node networks. 

A lightpath connection request is considered blocked if the working and 
protection paths cannot be established at the same time. After a coimection is 
terminated, all the network resources reserved by the connection are 
released. In this simulation, each trail has 100,000 connection requests under 
a load condition of each S-D pair. Data is derived by averaging the results of 
4 trials. Poisson arrival and exponential distributed holding time are 
assumed. For each traffic load level of experiment, every node-pair has a 
random arrival and departure rate. The load of the kth S-D pair is defined as 
the ratio of \ over , where is the arrival rate and is the departure 
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rate. WjAout loss of generality, the value of is uniformly distributed with 
means A , and = 1 for all k. Thus the average load p of the network is 
A. 

Simulation Results 

Fig. 7.18 shows the computation time versus the size in each subset. 
According to the administrative operation requirement and network traffic, 
the maximum allowable number of working paths in a subset for 
optimization can be derived. Fig. 7.19 demonstrates the performance versus 
the mileage in each subset. With more mileage optimized in a subset, better 
performance is yielded in both light and heavy network traffic. The 
experiments were conducted in a heavy load condition (i.e., jo =0.4) and in 
a light load condition (i.e.,jo =0.2). Even if the network is heavily loaded, 
the arrival rate of network events (symbolized as E here) can be small since 
the traffic load of the kth node pair is defined as the ratio of A^ and , and 
has nothing to do with the absolute values of and . Fig. 7.18 and Fig. 
7.19 verify Eq. (7.22) and Eq. (7.26). We have only conducted the 
experiment using the 22-node network since the results are intuitive. 




Fig. 7.18. Computation time versus number of working paths in a subset 
for optimization. The total computation time is 0.24 seconds as Mw = 1 in a 
subset using ULTRA 10, which has been degraded as SSR. The 30-node 
network is adopted. 
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Normallzad Blocking Rate vs. No. Working Paths 
In each Subset (22-node) 




Fig. 7.19. Normalized blocking rate versus the number of working paths 
in each subset of working paths that are optimized together. The mileage is 
normalized based on the total mileage of the working paths in the network. 

The relationship between the period of performing SLSP-R versus the 
call blocking rate with light and heavy network traffic is shown in Fig. 7.20. 
The blocking rate is normalized to the situation in which SLSP-R is 
performed in every interval of two consecutive network events. We can 
easily observe from the figures that a network with heavy traffic needs more 
frequent invocation of InP formulation sub-routine than is the case with light 
traffic if the same performance is to be maintained. Another important 
observation is that the number of connections residing in the network is 
larger when the traffic is heavy. As a result, more tear-down requests in a 
row may leave larger room to improve than the situation where the network 
traffic is light. Fig. 7.20 is also a verification of Eq. (7.25). 
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l4iTt3ergf Bants Between TtoSudo^^ OstirrizEtion 



(a) 



Riequencyaf Iriuddng SLSP-R Rscon^guration 
h/bchsrismwtth Heavy Tretffic 




(b) 



Fig. 7.20. Frequency with which the SLSP-R reconfiguration sub-routine 
is 1 voked versus the normalized call blocking rate with (a) light network 
tral c and (b) heavy network traffic. 

The computation time for completing a reallocation process, Treaiio, 
depends on only the number of working paths existing in the network and 
the computing power of the system, which needs synchronization with the 
dynamicity of the network before the two quantities (i.e., Tredio and Te ) can 
be comparable. In order to synchronize the time scale between network 
events and the computation time for the InP formulation process, we assume 
that p is chosen such that the successful rate of completing an optimization 

during two events is between 0.5 and 0.01 (i.e., the number of events 
between two successful optimizations is between 2 and 100). The 
synchronization process derives the parameter a in Eq. (7.18) such that the 
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'^reaiio “ 2 • Tg when 30 working paths are grouped in a single subset (i.e.. 



T = 

reallo 



Mw 






Mw=30 



= 2-T,). 



The value of Te can be easily derived with the data provided in Table 7.4. 
Note that the total number of working paths existing in the network is 
determined only by the network topology and the traffic load in the steady 
state by a rough estimation fi:om the historical record, and is listed in Table 
7.4 for the four networks with different traffic load. The program will invoke 
the SLSP-R algorithm after an event is filled, and will apply the optimization 
result onto the network link state database only if Tredio is less than the time 
dmation between this and the next event. 





No. of 
Link 


Avg. Nodal 
Degree 


Avg. 

Distance 


Light 

Load 


Light 

Load 

JmL 


Average 

d-Y 


22-node 


44 


4.00 


2.49 


' 190 ■ 


w^m 


10 


30-node 


63 


4.20 


2.71 


410 


450 


12 


79-node 


108 


2.73 


6.57 


1100 


1200 


15 


100-node 


179 


3.58 


9.50 


1600 


1750 


22 



Table 7.4. Topology information and the approximate number of working 
paths under different load conditions of steady state for the four networks. 

A comparison of performance in terms of blocking probability for 
different number of working paths in a subset (i.e., Mw) under both a light 
load and heavy load is shown in Fig. 7.21(a) and Fig. 7.21(b), respectively. 
The best performance can be achieved in each network with a specific size 
of a subset, Mwopt- The smaller the network size is, the smaller value of 
Mwopt can be derived. Performance decreases when the size of a subset is 
getting larger due to the fact that there is little chance for the optimization 
process to be completed before the arrival of the next event. As a heavy 
network load has a smaller value of MWopt than does a light load, the 
reconfiguration process may bring more improvements to networks 
accommodated with more working paths. Therefore, the contribution from 
each invocation of SLSP-R on heavy-loaded networks is greater than with 
light network traffic. 

The framework of performance optimization with the SLSP-R re- 
allocation algorithm can be generalized to any network environment with 
different topology (i.e., the parameter a), traffic levels (average inter-arrival 
time of network events, Tg), and computing power (i.e., the relationship 
between Tredio and Mw, and parameter a). 
































203 



No. of Working Paths in a Subset vs. Normalized 
Blocking Rate (Light Load) 




(a) 




Fig. 7.21. Mileage of working paths in a subset versus blocking rate with 
(a) light load and (b) heavy load. 

7.6 Summary 

This chapter discussed a number of topics regarding spare capacity 
allocation in optical networks. The topics included the span-oriented and 
path-oriented allocation schemes. The path-oriented schemes need to 
investigate the relationship between working paths in the network while the 
span-oriented schemes do not. We introduced Node Cover, Ring Cover, 
Span Protection, Protection Cycle and Pre-configured Cycles (p-cycle) as 
Span-oriented schemes, and an Integer Progra mming model - as well as a 
series of approximating optimal approaches - for the implementation of the 
path-oriented spare capacity allocation schemes. 

This chapter put much emphasis on the path-oriented scheme due to the 
fact that it can serve in an environment of WDM networks without full 
wavelength conversion capability. Other approaches included the Successive 
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Survivable Routing (SSR) and SLSP with spare capacity Re-allocation 
(SLSP-R). A systematic and efficient approach for correlating different 
working capacity was introduced, in which the relationship between 
working, protection, and spare capacity required for full restorability are 
encoded into a matrix form. With the matrix method a distributed 
management with the linear and additive characteristics of working and 
protection paths is possible, which can largely reduce simulation efforts and 
provide a systematic and error-free control environment. Since the 
interleaving of working paths into several subsets can compromise 
confutation complexity with capacity efficiency, three types of working 
path interleaving schemes were developed, including the most overlapped 
(MO), most diverse (MD), and randomly distributed (RD). Then we 
demonstrated an experiment in which different optimization and 
approximating optimal approaches were used to perform the spare capacity 
allocation. In addition to the multi-trial local search, SSR, and InP with MO, 
MD and RD (three types of working path interleaving), we listed the 
experiment results for all the above approximating schemes as well as the 
optimal one. We concluded that the InP formulation with the most diverse 
(MD) type of working path interleaving yields the best performance 
compared with all the other approximating optimal schemes. 

The spare capacity re-allocation scheme SLSP-R, which re-allocates 
spare capacity as an inter-arrival plaiming, is designed for the network 
envirorunent with dynamic traffic. Since the optimization process in an inter- 
arrival planning may fail due to an insufficient computation time (i.e., the 
next network event arrives before the current optimization process finishes), 
the re-allocation process is interrupted - which obviously impairs 
performance. SLSP-R improves network performance by manipulating the 
size of each subset of working paths, where both the traffic dynamicity and 
computation efficiency are taken into consideration. We verified through 
simulation that the probability of successfully completing a reallocation 
process directly influences performance. The increase of the size of the 
subsets can improve the optimization quality, while reduces the probability 
of performing a successful optimization. On the other hand, using small- 
sized subsets can improve the extent of resource sharing while reducing the 
probability of performing a successful optimization. The framework of 
SLSP-R provides an approach to improving network performance by 
initiating a compromise between the above two counter effects. 

A comparison between all the approaches mentioned in this chapter in 
terms of scalability, dynamicity, class of service, capacity efficiency, and 
restoration speed is given and listed in Table 7.5. 
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Scalability 


Dynamicity 


Class of 
Service 






Node cover 


Low 


Low 


Low 


Low 


Low 




Low 


Low 


Low 






BylPB 


Low 


Low 


N/A* 


Low 


High* 


SSR 


Med 




Low 


Med 


Med 


Span 

Protection 


Med 


Low 


Low 


Med 


Med 


SLSP-R 








Med 


Med 



Table 7.5. Comparison of Protection and Restoration Strategies 
*The scheme “Protection Cycles” is a pure link-based protection, in which 
every link has a dedicated protection cycle. 

**p-cycle has the highest restoration speed because of its pre-configured 
spare resources instead of only being pre-planned. 

***SLSP-R divides a working path into several protection domains, and 
supports the dynamic allocation of working-protection path-pairs, which can 
achieve scalability, dynamicity and class of service. 

References 

[1] O. Crochat, J. L. Boudec, and O. Gerstel, “Protection Interoperability for WDM Optical 
Networks,” lEEE/ACM Transactions on Networkings Vol. 8, No. 3, pp. 384-395, June 
2000. 

[2] D. Stamatelakis, W. D. Grover, “Network Restorability Design Using Pre-configured 
Trees, Cycles, and Mixtures of Pattern Types,” TRLabs Technical Report TR-1999'05, 
Issue 1.0, Oct. 2000. 

[3] W. D. Grover and D. Stamatelakis, “Cycle-Oriented Distributed Preconfiguration: Ring- 
like Speed with Mesh-like Capacity for Self-planning Network Restoration,” Proceedings 
IEEE International Conference on Communications y June 1998, vol. 1, pp. 537-543. 

[4] B. M. E. Moret and H. D. Shapiro, Algorithms from P to NP Volume I Design 
Efficiency, the Benjamin/Cummings Publishing Company, Inc, 1990. 

[5] M. Herzberg and S. J. Bye, “An Optimal Spare-Capacity Assignment Model for 
Survivable Networks with Hop Limits,” Proceedings IEEE, Globecom 1994. 

[6] G. Ellinas and T. E. Stem, “Automatic Protection Switching for Link Failures in Optical 
Networks with Bi-directional Links,” Proceedings IEEE GLOBECOM'96y November 
1996, vol. 1, pp. 152-156. 

[7] G. Ellinas, A. G. Hailemariam, and T. E. Stem, ‘Trotection Cycles in Mesh WDM 
Networks,” IEEE Journal on Selected Areas in Communications, vol. 18, no. 10, pp. 1924- 
1937, Oct. 2000. 

[8] D. Guo, J. Zhan, W. Chu, H. Zhang, D. Papadimitriou, S. Ansorge, N. Ghani, J. Fu, Z. 
Zhang, D. Pendarakis, and S. Dharanikota, “Optical Automatic Protection Switching 
Protocol (0-APS)“,Intemet Draft, work in progress, < draft-guo-optical-aps-01.txt>, Nov. 
2001. 




































206 



[9] P. Mated and N. Deo, “On algorithms for enumerating all circuits of a graph,” SIAM J. 
Computation, vol. 5, no. 1, Mar. 1976, pp. 90-99. 

[10] D. Bertsimas and J. N. Tsitsiklis, Introduction to Linear Optimization, Athena Scientific, 
1997. 

[11] W. D. Grover, J.B. Slevinsky, M.H. MacGregor, “Optimized Design of Span-oriented 
Survivable Networks,” Canadian Journal of Electrical and Computer Engineering, vol. 
20, no.3, August 1995, pp. 138-149. 

[12] R. R. Iraschko, M. H. MacGregor, and W. D. Grover, “Optimal Capacity Placement for 
Path Restoration in STM or ATM Mesh-Survivable Networks,” lEEE/ACM Transactions 
on Networking, Vol. 6, pp. 325-336, June 1998. 

[13] Y. Liu, D. Tipper, and P. Siripongwutikom, “Approximating Optimal Spare Capacity 
Allocation by Successive Survivable Routing,” in Proceedings of IEEE Infocom'Ol, vol. 2, 
pp. 699-708, April 2001. 

[14] Y. liu and D. Tipper, “Spare Capacity Allocation for Non-Linear Link Cost and Failure- 
Dependent Path Restoration”, in Proceedings of the third International Workshop on 
Design of Reliable Communication Networks (DRCN 2001), Budapest, Oct. 2001. 

Questions: 

1. What is failure propagation? Give an example of a protection mechanism 
getting more complex because of the failure propagation phenomenon. 
Why does span-based protection not suffer from this phenomenon? 

2. Explain the main differences between span-oriented and path-oriented 
spare capacity allocation schemes. Why are the path-oriented schemes 
more suitable in WDM networks? 

3. In the span-oriented spare capacity allocation, a cycle put into networks 
should be directional for protecting a specific group of lightpath 
segments along a directional link. Do you agree that we have to either 
consider bi-directional cycles or to manipulate each cycle with 
orientation? Explain your answer with a computation complexity 
analysis. 

4. With span-oriented spare capacity allocation, different wavelength planes 
need independent cycles (or cycles with different colors) in case no 
wavelength conversion is presented. With the consideration of 
wavelength continuity constraint, give a brief analysis of the increase of 
computation complexity compared with the situation in pure IP networks. 

5. If the network has only a single fiber along each directional link, each 
wavelength plane has only two wavelength channels in different 
directions along each span. Under this assumption, a bi-directional cycle 
may exhaust all the network resources along the physical routes on a 
specific wavelength plane. Explain this phenomenon. 

6. Considering all the cases described in questions 3, 4, and 5, estimate the 
increase in computation complexity compared with the situation where 
conventional IP networks are adopted. 
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7. Describe the working principles in using Node Cover, Ring Cover, Span 
Protection, and p-cycle. Discuss the pros and cons of each of these 
schemes. 

8. Briefly describe Successive Survivable Routing (SSR). We can use the 
statistical multi-trial approach to improve the performance of SSR by 
randomizing the sequence of working paths to be allocated. However, it 
may happen that two or more trials have the same sequence. Calculate 
the probability of having no replicated sequence in an experiment with N 
trials, where the number of working paths is D. What is the expectation 
in the number of redundant trials? 

9. Provide at least three different approaches to define neighborhood for 
Local Search in spare capacity allocation. 

10. Briefly describe simulation aimealing. What is the control temperature 
used for? Why do we need to change the control temperature in different 
iterations? 

11. In the matrix approach, we derive the maximum of each row of the spare 
provision matrix to be the minimum spare capacity required. Why is it 
the maximum instead of sum of the elements in a row? 

12. Why can we define an inter-arrival planning in an optical network? Can 
we have the same definition in the IP networks? Discuss the differences 
between control and traffic dynamics in the optical and IP networks. 

13. List the possible operations or mechanisms that may be included in the 
inter-arrival planning process. Briefly explain the purpose for each of 
them. 

14. In Fig. 7.10(a), what is the spare capacity vector on link A-B? 

15. In the SRLG coherence, each node maintains a spare provision vector for 
each of its correspondence links. Give the formula for solving the spare 
provision vector given the working paths traversing the link and their 
protection paths in the network. 

16. How to solve minimum requirement of the spare capacity when we 
allocate a working path into the network with the knowledge of all spare 
provision vectors for the links that are traversed by the working path? 

17. What is the main purpose in interleaving the optimization process for 
spare capacity re-allocation? What is the strategy adopted for this 
purpose in this chapter? 

18. If we increase the number of working paths (or P domains) in a subset, 
what network indexes are going to be increased? What indexes are going 
to be decreased? 

19. Why does the grouping policy “most diverse” (MD) achieve the greatest 
extent of resource sharing? 

20. Why do we need to synchronize Treaiio and Tg? Explain the physical 
meanings. 
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21. As we saw in the simulation, we performed a reallocation process if the 
inter-arrival time 7 m is greater than Tredio- 

(a) Describe the approach in this chapter by which the traffic dynamics is 
quantified. 

(b) Describe the reasons why the best performance can be achieved by 
manipulating the number of working paths in each subset (Hint: 
describe the two observations where the performance can be 
improved and worsened when the size of each subset is increased). 

(c) If the traffic pattern does not follow Poisson distribution (e.g.. Self- 
similar or M/G/1), what do you expect the intact on the framework? 

(d) Do you think there will be any difference if we use the following 

strategy for determining whether a reallocation process is conducted: 
for every completion of a network event, we flip a coin with a 
probability of {Tg > Treaiio } = that it will be a “head”, and 1 

- that it will be a “tail”. Whenever a head is present, a 

reallocation is performed. Explain your answer. 

(e) Use the information provided in Table 7.4 with the traffic load 
p = 0.2 and p = 0.4 , plot F'^m ^ ’^redio ) versus Mw. 

22. A network topology is given in Fig. Q.22. The traffic distribution is 
shown in Table Q.22. Assume the working lighpaths between an S-D 
pair follow the same physical route, and the spare lightpaths for the 
working lightpaths of an S-D pair also go through the physical route. 
Assume that the allocation of a lightpath along a link takes one unit of 
spare capacity. 

(a) Write down the working and protection path-link incidence matrix, 
and the spare provision matrix for the network. 

(b) Find the minimum spare capacity (i.e., number of bidirectional spare 
lightpaths) required along each span to achieve 100% restorability. 

(c) A new connection request arrives across node A and F for 3 
bidirectional lightpaths. The physical route for the protection 
lightpaths is A-D-G-F. Find the new working and protection path-link 
incidence matrix, and the spare provision matrix for the network. 

(d) Can we improve the capacity efficiency by allocating the spare 
capacity for the newly arrived connections through A-B-E-F? How 
much saving of extra spare capacity do we get? 

(e) For the new connection request, if we are allowed to allocate the 
protection lightpaths between an S-D through different physical 
routes (e.g., putting one lightpath through A-D-G-F, and two through 
A-B-E-F), what is the best capacity efficiency that we can achieve? 

(f) We attempt to reconfigure the spare capacity in the network apparatus 
shown in Fig. Q22. Assume the protection lightpaths of an S-D pair 
are not necessarily routed along the same physical route. Find 3 
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possible arrangements of physical routes of protection lightpaths for 
the working capacity between each S-D pair. 

(g) Formulate the reconfiguration problem into Integer Programming 
(InP) problem. Use the symbol given in this chapter. 




Fig. Q.22. 



A-B 

B-E 

A-C 

B-C 

E-F 

C-F 

A-D 

C-D 

F-G 

D-G 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 



Working Flows 


Number of bidirectional 
lightpaths required 


Protection Routes 


a (A-B-E, or 1-2) 


2 


A-C-F-E (or 3-6-5) 


b(A-D-G,or7-10) 


3 


A-C-F-G (or 3-6-9) 


c (A-D, or 7) 


1 


A-C-D (or 3-8) 


d(B-C-D-G, or 4-8-10) 


1 


B-E-F-G (or 2-5-9) 


e(B-E-F-G, or 2-5-9) 


3 


B-E-F (or 2-5) 


f(E-F-G,or5-9) 


3 


E-B-C-D-G (or 2-4-8-10) 


g(A-C-F,or3-6) 


2 


A-D-G-F (or 7-10-9) 


h(C-F, or link 6) 


2 


C-D-G-F (or 8-10-9) 


i(A-B, or link 1) 


3 


A-C-B (or 3-4) 



Table Q.22. 



23. Three working lightpaths with the same bandwidth are shown in Fig. 
Q.23(a): W1 (S-E-F-G), W2 (A-B-F-G) and W3 (K-H-I-J-D). Then- 
protection paths are PI (S-A-B-C-D-G), P2 (A-K-H-I-J-D-G), and P3 
(K-A-B-C-D). 

(a) Find spare provision matrix for the network. 

(b) Find the spare capacity vectors for link F-G, A-B and B-F. 

(c) Use the result of (b), find the spare capacity of each span after another 
working lightpath going through J-D-G-F is allocated. 

(d) If we have the spare capacity originally deployed as shown in Fig. 
Q.23(b), where W3 is segmented into two protection domains - one 
for path segment A-B-F and another for B-F-G. The protection path 
segments for the two protection domains is A-S-E-F and B-C-D-G, 
respectively. Repeat (a) to (c). 
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Fig. Q.23. 




Chapter 8 



SURVIVABLE ROUTING WITH DYNAMIC 
TRAFFIC 



The spare capacity allocation schemes introduced in Chapter 7 either take 
a specific group of working paths or all working paths into consideration at a 
time during the optimization process. However, as any connection request 
arrives, an on-line algorithm is still required to allocate it. The on-line 
algorithm should be able to guarantee derivation of a link- or node-disjoint 
or link-disjoint working and protection (W-P) path-pair for a connection 
request if any exists. Since the reconfiguration or reallocation of spare 
capacity in the networks may not be frequently performed, the on-line 
algorithm of path selection needs to be both computation- and capacity- 
efficient so that networks can accommodate as many subsequent coimection 
calls as possible without losing the ability to deal with dynamic traffic 
requests that arrive one after the other. 

This chapter presents a number of reported survivable routing schemes 
for both dedicated and shared protection. In Section 8.1, we first define 
survivable routing, while Section 8.2 introduces diverse routing algorithms 
for dedicated protection. Both the symmetrically weighted and 
asymmetrically weighted optimal or near optimal approaches are included. 
Section 8.3 introduces an approach to solving the diverse routing problem 
for shared protection (i.e., different protection paths can share network 
resources if the SRLG constraint is met). Section 8.4 presents a dynamic 
survivable routing scheme for shared protection. 
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8.1 Survivable Routing 



“Survivable Routing” is defined as the attempt to derive a W-P path-pair 
using any adaptive path selection algorithm with a well-designed cost 
function and dynamic link-state metrics. The survivable routing schemes can 
be used either for dedicated or shared protection. With the latter case, a 
protection path is derived based on the location of its corresponding working 
path, so that the dependency between the two paths can be addressed. This 
task can be accomplished by the Successive Survivable Routing (SSR) as 
described in Chapter 7. With the former, since no resource sharing is 
allowed, the allocation of path-pairs should be selected such that the W-P 
paths take up the smallest amount of network resources possible; (e.g., the 
total hop count of the two paths is minimal). In order to meet some specific 
design requirements, the optimization objective could possibly be the hop 
count of a working path weighted by an integer ri: , plus the hop count of the 
corresponding protection path. 

In SSR, as we have seen, each traffic flow routes its working path first, 
followed by its backup path. This sequential derivation of a W-P path-pair is 
where the “successive” notion originates. SSR is also called the Two-Step- 
Approach [2] since the derivation of working and protection paths comes in 
two completely separate steps. To maximize resource sharing, the SRLG 
constraint has to be considered and included in the path selection process. 
Although this method is straightforward and simple, it may fail to find any 
disjointed path-pair after erasing the first path, which isolates the source 
node from the destination node in the network, as shown in Fig. 8.1. Besides, 
in some cases the algorithm cannot find the optimal path-pair if another 
better disjoint path-pair exists in which the working path is not the shortest 
one. To avoid the above drawbacks, we will introduce an algorithm called 
the Iterative Two-Step-Approach (ITSA), which examines not only the 
shortest path as a working path, but also examines the loop-less k-shortest 
paths, where k = 1, 2, ...., K. To be more specific, the ITSA algorithm 
discards any working path once it isolates the network, and inspects the next 
shortest path as a worWng path. The ISTA algorithm will keep on inspecting 
the k-shortest paths as working paths until a feasible answer is derived. 

Design for dedicated protection is much simpler than is the case with 
shared protection, as we do not have to consider resource sharing. In this 
case, working and protection paths can be derived at the same stage with a 
uniform link-state. In order to achieve specific design requirements in some 
cases, we may need to put a weighting on the cost of a working path (i.e., a 
cost function a C+CP is adopted, where C and CP are costs of a working 
path and its corresponding protection path, respectively; and the parameter 
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a is a weighting on the working path). This is also called an asynunetrically 
weighted diverse routing problem. The physical meaning of adopting the 
weighting parameter a is to distinguish the importance of the segment 
between the lengths of working and protection paths. A special case of a = 1 
is a degradation of the use of the cost function a C+CP, and can be solved 
by Suurballe’s algorithm [2] within a polynomial computation time. In this 
chapter, in addition to an introduction to Suurballe’s algorithm, the solving 
of the asymmetrically weighted diverse routing problem will be addressed. 
The problem has been proved to be NP-complete in computation complexity 
[6], which is intolerable if the algorithm is to be used on-line. We will 
therefore examine several heuristic approaches that can greatly improve the 
computation efficiency. 




Fig. 8.1. An example demonstrating that the two-step-approach cannot 
guarantee the discovery of a node-disjoint path-pair even if a set exists. 

8.2 Solving the Asymmetrically Weighted Node-Disjoint 
Path-Pair Problem 

This section presents a series of approximation algorithms for solving the 
asymmetrically weighted disjoint path-pair problem. The input of the 
problem is a directed graph 0(0., E) representing a network topology where 
the set of nodes is denoted by ^2 , and the set of arcs is denoted by E with a 
positive cost for each link (denoted as of arc ee £). An arc from node v 
to u is denoted by e(v -4 u ) . The arc is also called an edge if it is undirected. 
The number of edges of the graph is denoted by |£| while the number of 
nodes is denoted by N. 
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The demand has a source node 5, and destination node D, and a factor (or 
weight) a for weighting on the cost of working paths. The task is to find two 
paths from a given source 5 to a destination D: the working path, and the 
protection path. The two paths should be node-disjoint and have the cost 
function f(S,D,a) = a C„+Cpto be minimized, where and Cp are the 

cost of the working and protection paths, respectively. The cost of a path is 
defined as the sum of the costs of each link belonging to the path. 

In a directed graph, a node-disjoint path-pair can be derived in 
polynomial time using the algorithm dealing with a link-disjoint path-pair, 
by splitting a network node into two twin nodes as shown in Fig. 8.2. One of 
the nodes is called an incoming node (i.e., Vi, V 2 , V 3 , and Va), and the other 
is called outgoing node (i.e., V 4 , V 5 , and Vb) [3]. Fig. 8.2 shows the 
transformation between the algorithms for deriving a link-disjoint path-pair 
and for deriving a node-disjoint path-pair in a directed graph. With the 
transformation, the computation conq>lexity of solving the link-disjoint 
diverse routing problem is the same as that of solving a node-disjoint 
problem. The rest of this chapter focuses on the node-disjoint case to show 
how both a node and link failure can be restored. As shown in Fig. 8.2, 
during the routing process the internal links (i.e., the links between two twin 
nodes) can only be used once at most, which yields the disjoint path-pair 
with the desired characteristic. 




Fig. 8.2. Transformation between node-disjoint and link-disjoint cases in 
the network topology. The cost of the dashed directed links is zero, which 
means that no cost is charged to go inside a node. 
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8.2.1 Iterative Two-Step- Approach 

The Two-Step-Approach algorithm can be enhanced by sequentially 
inspecting the k-shortest paths. This is called the Iterative Two-Step- 
Approach (irSA). The best solution can be derived based on a pre-defined 
cost function. The flowchart of the algorithm is shown in Fig. 8.3. At the 
start of the algorithms, the variables CS := °° and CPS := «> , which store the 
cost of the best W-P pair, as well as SPi„dex and PPi„dex> which keep the best 
W-P path-pair, are initialized. The ith shortest path SP, is taken as the 
working path to find the shortest path PPi in the same graph, with the edges 
and nodes of SP, being erased. The cost of the path-pair « C, +Ci^ will be 

compared with the cost of the recorded path-pair (i.e., a-CS+ CPS) to see if 
the current path-pair is a better one. If so, the cost and the index of the 
current path-pair are recorded as CS, CPS and Index. 

The algorithm uses the inequality (C, - C5) • a > CPS - C,- to determine the 

optimality of the recorded path-pair. Since Ct is increased monotonously as i 
is increased, and CS > CPS, it is clear that if the above inequality holds, with 
a larger i (i.e., using more costy working paths) will never be able to break 
the inequality. Therefore, the current recorded path-pair is the optimal. The 
algorithm can also be stopped by a pre-defined criterion, such as the 
computation time or the number of k-shortest paths that have been inspected, 
which is determined by the requirements of the system. 

The rrSA algorithm can find the asymmetrically weighted optimal node 
disjoint path-pair, which yields a computation complexity strongly 
determined by the efficiency of the k-shortest path ranking algorithm. Yen’s 
k-shortest path ranking algorithm [4] provides a complexity of 

0(N^ -logN) , which satisfies the requirements of dynamicity and 
scalability only to a limited extent. The other disadvantage of the heuristic 
algorithm is that there may be an exponential number of distinct k-shortest 
paths between node-pair (S,D). In other words, its complexity in the worst 
case gives an exponential upper bound in finding the optimal solution. 
Hence, more efficient and performance-guaranteed algorithms are needed. 
However, the ITSA algorithm can work extremely well in solving the 
diverse routing problem when shared protection is adopted. We will further 
investigate shared protection schemes in Section 8.3, where the ITSA 
algorithm will be a main focus. 
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Fig. 8.3. Flow chart for finding the disjoint W-P path-pairs. 

8.2.2 Suurballe’s Algorithm 

A brief overview on Suurballe’s algorithm is provided in this section. 
Suurballe’s algorithm can find node-disjoint path-pairs in the case of a = 1 
with a polynomial time complexity of 0(|iS|-log,,^£|/jyj ^) , where n is the 

number of nodes and m is the number of edges in the network. The algorithm 
is described as follows. 

Step 1: Find the shortest path SP from source S to destination D (e.g., 
S-»nl->n4->D) in a given graph G( Q. ,E) as shown in Fig. 8.4(a). 
Label each node by d(i), which is defined as the node potential determined 
by its distance from the source. The labeling of each node can be 
accomplished with Dijkstra’s algorithm. 

Step 2: Reverse all arcs along the shortest path, and multiply their cost with - 
1, as shown in Fig. 8.4(b). 

Step 3: Modify the cost of all arcs according to the following formula: 
+d(i) -d(j) to get a non-negative arc cost as shown in Fig. 
8.4(c). The modified graph is denoted by G' . 
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Step 4: Find the shortest path SP' in the modified graph G', which gives 
SP' as ( S ^ n3 n4 -> Til -4 n2 -4 D ) as shown in Fig. 8.4(d). 

Step 5: Map the shortest path of SP' to G and eliminate the common arcs of 
SP and SP' in G to obtain the two desired paths with minimum total cost, as 
shown in Fig. 8.4(d), which shows the result of the second shortest path 
search. 





in the circle next to the node. The cost of an arc is beside the arcs. 

8.2.3 Integer Linear Progranuning Formulation (ILP) 

The optimal node-disjoint path-pair can also be solved via an Integer 
Linear Programming formulation as follows: 

Objective: 



Minimize • x^ + y^)-Cg 

eeE 



Subject to the following three constraints: 







218 



1. The flow conservation constraint for all nodes / e V ; 



N N 



;=1 *=1 



1 if I = source node S 
■ - 1 if i = destination node D 
0 otherwise 



N N 



y=l *=1 



1 if i = source node S 
■ - 1 if i = destination node D 
0 otherwise 



( 8 . 1 ) 



2. The binary-flow constraints: 



X, e {0,l}. y, € {0,l} for all edges ee E 



( 8 . 2 ) 



3. The constraints of link-disjointness of the path-pairs: 



< 1 for all edges ee E (8.3) 

In the above expressions, the binary flow indicators, and y«, take value 1 
if either the working or protection path uses the edge e, or 0 otherwise, a is 
the weighting factor for the working path, and Cg is the cost of taking edge e. 
According to the constraint in Eq. (8.3), the working and protection paths 
will not use the same edge. To make the path-pair node-disjoint, the 

N 

following constraint should be held instead of Eq. (8.3): ^ (Xy + yy ) < 1 

M 

for all edge e & E and for all nodes i except the source and the destination. 



8.2.4 Linear Relaxation 

Solving the ILP formulation of the problem is NP-hard. A more 
computationally efficient approach has to be developed for dynamic 
networks. To solve a linear programming (LP) or flow problem is much less 
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complex than solving an Integer program. Thus, we adopt two more simplex 
methods: Single Flow Relaxation (SFR) and Linear Programming 
Relaxation (LPR). 

8.2.4.1 Single Flow Relaxation 

The purpose of Single Flow Relaxation is to solve Minimal Cost 
Network Flows (MCNF), and is much faster than solving a Linear 
Programming Relaxation (LPR), which can be degraded to an MCNF 
problem via the following procedure. 

On the same graph topology GiQ.E), the capacity constraint along a link 
is set to a instead of 1, and a flow of size 1 + a is launched by node s to the 
network. The objective of the algorithm is thus to find flows with minimum 
^^e'fe ’ '''here f^=x^ + represents the flow on arc ee E . In other 

words, the constraint in Eq. (8.1) is replaced by the follows: 

For all nodes isV , 



1 if / == source node S 

N N 

X yij ) ~ X ^ ^ “ destination node D 

0 otherwise 

Although single flow relaxation yields much better computation 
efficiency, it can easily yield a split-flow problem which invalidates the 
derived results. An example is shown in Fig. 8.5(a) and Fig. 8.5(b). There 
exists a large number of fast Minimum Cost Network Flow (MCNF) 
implementations with time complexity of 6>(|£|-logjj^|g|^^) A^), such as the 

Push-Relabel method [5] and the Network optimizer CPLEX, where n and m 
are the number of nodes and links in the network. The problem can also be 
solved by any other available LP solver, such as LP-SOLVE. 
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Fig. 8.5. In the Single Flow Relaxation, the constraint of integer flows is 
relaxed and may yield invalid results: (a) the case with valid results, in 
which the flows do not split, and (b) the case with invalid results, in which 
the flows split at node nl. 

An enhancement to the SFR is presented by J. Tapolcai et al [6], which 
makes use of Suurballe’s algorithm. The following theorem verifies that SFR 
can be entirely substituted by Suurballe’s algorithm since both of them yield 
exactly the same result, while Suurballe’s algorithm has a much better time 
complexity than that of SFR. 

To show the enhanced SFR, the following lemma was proved by Suurballe 
in reference [7]. 

Lemma 1. In implementing Suurballe’s algorithm, use d"{i) = d{f) + d\i) 
as the node potential to modify the cost of each arc (the reduced arc cost is 
denoted by: 

+d'ii)-d\j)^c,,^j, +id'(i)+d(i))-id'U)+dU)), 

then the arcs of the disjoint paths would have negative or zero reduced cost, 
while all the other arcs would have non-negative reduced cost. 

An example for the enhanced SFR is shown in Fig. 8.6, in which each 
node is relabeled with the reduced arc cost and node potential. With the 
reduced graph, using SFR can simply derive the optimal link-disjoint path- 
pair. The following theorem states this relationship. 
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Fig. 8.6. A relabeling on the graph shown in Fig. 8.4 according Lemma 1. 

Theorem 1. SFR can be substituted by Suurballe’s algorithm. 

Proof: SFR try to find a minimum cost network flow of size a + 1 between 
the source S and the destination D, while the capacity of each edge is set to 
a . To get the optimal node-disjoint path-pair between S and D, the 
following criteria must be held: 

The flow / gives an optimal answer if and only if there exists a node 
potential function d such that d{S) = 0, and 



-d(i) + d(j) < 0 
=> = capacity{e{i j)) 



where / is the flow indicator of an arc, which equals the size of the flow 
passing through the arc, and capacity{arc) is a function which returns the 
capacity passing through the arc. 

It is clear that with the node potential relabeled using the approach stated 
in Lemma 1, an optimal node-disjoint path-pair can be derived as the size of 
the flow launched in the network is a + l. Let denote the shorter path of 
the path-pair derived by using Suurballe’s algorithm, and let Pp denote the 
longer one. The following observations can be made: 

1. If there are no common arcs along SP and SP’, then a flow of size a is 
assigned to SP and a flow of size 1 is assigned to SP’. In this case an 
optimal solution can be derived by SFR. 

2. If the total cost of the disjoint path-pair given by Suurballe’s algorithm 
equals the cost of SP (i.e., cost(SP) = ^/(t) = cost(P„) + cost(Pp)), then a 
flow of size a is assigned to Pw and 1 to Pp. Note that in this case 
cost(SP) = cost(P,v) = cost(Pp), so that we can get the optimal solution by 
SFR. 

3. If common arcs exist between SP and SP’, then a flow of size a -1 is 
assigned to the common arcs (i.e., SP n SP’), a is assigned to the rest of 
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Pw, and 1 is assigned to rest of Pp. In this case the optimality criteria are 
satisfied, and there is a flow of a +1 between the source and destination. 
Using the results of Theorem 1, an enhancement of SFR (ESFR) can be 
defined which makes use of Suurballe’s algorithm instead of MCNF, and is 
stated as follows. 

Step 1: Conduct Suurballe’s algorithm on the graph G(Q ,£). 

Step 2: If there is no common link, or if the cost of each path equals the cost 
of the shortest path, then an optimal solution for all at \ can be 
derived (i.e., Py„ = SP, and Pp = SP’). 

Step 3: Run ILP to get an optimal solution if both Step 1 and Step 2 fail. 

ESFR can find 75~80% of the S-D pairs of the networks illustrated in 
Fig. 8. 10(a) ~ (d) without resorting to the time-consuming ELP process. 

8.2.4.2 Linear Programming Relaxation (LPR) 

We now consider the approach of Linear Programming Relaxation (LPR), 
in which the constraint in Eq. (8.2) is relaxed. A new constraint is imposed 
instead, namely 0 < ^ 1 and 0 ^ ;c^ ^ 1 for all links ee E. This approach is 

also known as the minimum cost two-commodity flow problem, which has a 
polynomial complexity in contrast to the Integer Linear Programming 
problem. In spite of its computation efficiency, LPR allows firactional flows, 
which yields invalid results. Using LPR to solve the node-disjoint path-pairs 
gives an optimal solution sometimes, while it also suffers from the split-flow 
problem. The fact that LPR yields a valid solution means that an integer 
solution (or two-path solution) is generated. In the following paragraphs, two 
types of split-flow problems are introduced, which are called reparable or 
unreparable split-flow problems respectively. We will also present an 
approach for solving the split-flow problem stated in [6]. 



8.2.4.2.1 Reparable Split 

If more than one optimal path exists between source and destination, the 
flows in the network may be split due to the essential characteristic of the LP 
solvers. In the example shown in Fig. 8.7(a), we assume that the cost of each 
link is 1, and that the total amount of flow from S to D is one. The numbers 
specified on the link represent the percentage of flow. In this case, both the 
working path (solid lines) and the protection path (dashed lines) are split 
even though they could also take single paths with the same minimum cost. 
Although this situation does not occur frequently to a practical network (i.e., 
having more than one path of equal cost between a node pair), it may ruin 
the whole computation process once it appears. There are two ways to solve 
this type of split-flow problem First, we can search for a basic solution in 
LP. Second, we can use Tiny Additional Costs (TAC), where random tiny 
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offsets are added to the cost of all links to distinguish the total cost of all 
possible paths. 

S.2.4.2.2 Unreparable Split 

Fig. 8.7(b) shows another type of split-flow problem. The working and 
protection paths are drawn as solid and dashed lines, respectively. The links 
5 -» 1 and 12 -> Z) have a cost of 5, and all the other links have a cost of 1. 
In this case, since both of the split flows for the two paths have a minimum 
total cost, it cannot be repaired by using TAG. Since the variation of the 
value of a may change the flows from split to non-split, or from non-split to 
split, an approximation can be made by slightly shifting the value of a and 
then recalculating. Based on this observation, a heuristic called “a -shifting” 
is presented below to approximate a near-optimal solution. 

The basic idea of a -shifting is to search a' that is closet to a , with which 
the relaxation process works with un-split or at least reparable split flows. In 
other words, a sub-optimal solution is derived as a tradeoff with computation 
complexity. An analysis is conducted on the cost of the path-pairs with 
different values of a . Let (5, D, a) denote the total cost of an optimal 

solution regarding a specific value of a , and fu,g(S,D,a) be the total cost 
derived by using the LPR process. It is clear that the plots 
for and fjj,^{S,D,a) against a are continuous, monotonously 

increasing, piecewise-linear, and concave. The piecewise linearity of the 
functions comes from the fact that with different values of a , a 
corresponding optimal solution can be yielded. As a result, the whole 
function is composed of one or more linear pieces, each of which represents 
a path-pair solution. Obviously, (5, £),«)< fu,g(S,D,a) since the flows 

are split to yield a smaller total cost, and the equality holds if the LPR 
process gives an integer solution. 

The heuristic a -shifting is described as follows: 

Step 1: Run the dual relaxation of LPR using the weighting parameter a . 

Step 2: If there is no unreparable split flow, then an optimal solution is 
found; stop. 

Step 3: Calculate the allowable increase and decrease A” of the variable 
a from the sensitivity report given by the LP solver. Initialize the 
following two variables: (/:=a+ A* and or' := or - A" . 

Step 4: Run the dual relaxation of LPR at a' and a " , which gives results by 
and P' , P' , respectively. 
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Step 5: If both LPR processes at a' and a" give integer solution (i.e., no 
unreparable split flow in the solution), the near-optimal solution is the 
cheaper one between a • cost( + cost( ) and a • cost( Pj ) + 

cost(P'). Stop. 

Step 6: If one of the LPR processes gives an integer solution, stop (a valid 
solution is derived). 

Step 7: If none of the LPR processes yields an integer solution, a new a" 
from the sensitivity report of a' (i.e., a" := a' - A" ) is tried. Otherwise, 
repeat Step 7. 

Note that in the algorithm presented above, when both a' and a" cannot 
yield an integer solution, only a new a 'is generated instead of generating 
both a' and a' . As a" is decreased to 1, the problem is degraded to a 
symmetrically weighted node-disjoint path-pair searching problem, and can 
be solved with Suurballe’s algorithm. The following flowchart shown in Fig. 
8.8 demonstrates a -shifting procedure. 
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(b) 



Fig. 8.7. Examples of the two types of split flow cases: (a) The case of 
reparable split flow, in which more than one optimal path exist in the 
network, and (b) the case of unreparable split flow, in which each flow has a 
minimum total cost and cannot be repaired in a polynomial-complexity 
computation process. 
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Fig. 8.8. Flowchart of the heuristic “ ^ -shifting” for solving the 
asymmetrically weighted node-disjoint path-pair problem. 

8.3 Diverse Routing for Shared Protection 

The algorithms described in Section 8.2 use the same link-state metric 
(e.g., hop count) to evaluate the optimality of a path-pair, which can only be 
applicable in the case of dedicated protection. To perform diverse routing for 
shared protection, the routing algorithm has to consider that working and 
protection paths should take different link-state metrics and routing 
constraints. Thus, a working path must be determined before the 
corresponding protection path can be derived. This section introduces an 
approach to dynamically solving a disjointed working and protection path- 
pair for a connection request according to current link-state and topology 
information. 

83.1 Cost Functions 

With the cost function a C+CP, the weight a is an estimator of the 
effect from the shared protection resources, with which working and 
protection paths are greedily selected under homogeneous network states 
(e.g., hop count). This section presents an algorithm based on the k-shortest 
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paths ranking algorithm for solving the asymmetrically weighted disjoint 
path-pair problem, which guarantees to yield an optimal solution for any 
value of a . The value of a should be determined by the protection type 
adopted. For example, if a dedicated protection is adopted (e.g., 1+1), the 
best value of a could be as small as unity, since no difference in the 
consumption of network resources exists between a working path and a 
protection path. For a shared protection scheme (e.g., 1:N or M:N), the best 
value of a should be large since the protection resources are shared by 
several working paths (or, in other words, the protection paths are less 
important than the working paths by a times). In this case, the network 
topology and traffic loads put a heavy influence on shareability, which 
makes the best value of a different from one case to another. 

The wavelength assignment scheme for a protection path can simply be 
the First-Fit scheme: the first wavelength plane that is available for 
protection, no matter the wavelength channel is, a free channel or a sharable 
spare capacity. Considering that protection resources can be shared by 
multiple protection paths, the value of a should be larger than 1. Fig. 8.9 
shows a flowchart for the survivable routing process. 




Fig. 8.9. Flowchart for deriving node-disjoint path-pairs considering the 
SRLG constraint. This flowchart differs from Fig. 8.3 by the bold-lined 
rectangle, where the protection path must be derived based on the SRLG 
constraint imposed by its working path. 
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In addition to using a to estimate the effect of shared protection, an 
alternative is to define another cost function C + CP’, where C = ^c,. , and 



(oo if the channel is not reservable 
otherwise 



CP’ is defined as CP’= , and 

i 



f oo if the channel is not reservable 



cpi=\l 



if an extra wavelength channel is needed 



[0 if no extra wavelength channel is needed 



( 8 . 10 ) 



The following paragraphs introduce an approach for finding the topology 
link-state for deriving a best protection path corresponding to a working path 
as defined in Eq. (8.10). 



8.3.2 Derivation of Protection Paths 

To solve for a protection corresponding to a working path, we need to 
find the link-state before invoking Dijkstra’s algorithm. The topology link- 
state for calculating working paths can be easily derived from the link-state 
database in each source node, which provides the per-wavelength 
availability of each link to the routing algorithm. Eq. (8.9) defines the link 
cost for solving working paths. The derivation of link-state for protection 
paths is much more complex since the resource sharing and diversity 
constraints have to be considered. To be more precise, with a specific 
working path we need to mark each link with either «> , 1, or 0 as defined in 
Eq. (8.10) in order to form a new link-state for solving the corresponding 
protection paths. 

The following algorithm is introduced to derive the link-state for finding 
a protection path corresponding to a given working path. When a new 
connection request arrives, the working path is routed first, which is termed 
W. We assume that the network contains link set L, and W contains link set 
Ly/\ the original spare capacity along each link is v, which is a 1 x|l| integer 
vector recording the spare capacity along each link. L(/] is denoted as \htjth 
link in the network. ivD^, is denoted as a working path set which contains 
working paths traversing through link / in common with W, and [n] is 
the n-th working path in the path set. The pseudo-code returns the cost of 
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each link, which can be used for solving the optimal protection path with 
Dijkstra’s algorithm corresponding to W, 

Initiate: 

s[] <—0; 

/* s[ ] is a 1x|l| integer vector which records the amount of spare capacity on 
each link consumed by the protection paths whose working paths have at least 
one common link with W, This spare capacity is subject to the SRLG constraint, 
and may not be sharable with the protection path of W. The spare capacity 
along link I is not sharable by the protection path of W if the amount of spare 
capacity is no more than ^[r]. In this case, W has to be protected by allocating 
an extra unit of spare capacity along link I if its protection path passes through 
the link; */ 

L5[ ]; /* LS[ ] is a 1x|l| vector which stores the cost of each link, and can be 
either 0, 1 or ^ ; */ 

For (/«-! to \l^\) 

<— all the working paths passing through link Lw[l] except W\ 

/* Le., all the other working paths which pass through Ly/[l] and share the 
same risk of a single failure on Lw[l] with W; */ 

For(7f~l to |l|) 
temp_S <— 0; 

/* tempos is used to store the amount of spare capacity on the mth link to 
protect working paths in NW; 

For ( n ^ 1 to |WD^| ) /* for every working path in ; */ 

If ( [n] has its protection path passing through L\j]) 
tempos temp_S + 1 ; 

/* tempos is the minimum amount of spare capacity on L\j] needed 
to protect the working capacity on link Lw[l], which is the 
bandwidth sum of all the protection paths traversing through 
L[/] whose working paths pass through Lw[/]; we need to sum up 
the spare capacity demand because of the SRLG constraint; *1 
End If; 

End For; 

If (tempos > ^[/]) 
sV] <—temp_S\ 

End If 

/* record the minimum amount of spare capacity needed by the jth link\ 
*/ 

End If , 

End For; 
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/* the following pseudo code determines the cost of each link, which is an 
implementation of Eq, (2); */ 

For(;^lto|L|) 

lHs\j]==v\j]) 

/* which means that the spare capacity along the jth link is fully required 
by all working capacity along the l-th link; therefore, the algorithm has 
to reserve an extra unit of spare capacity for protecting W if the 
corresponding protection path traverses through the jth link; Note that 
j[/] is always equal or less than v(/] since the existing working paths are 
supposed to be well protected; */ 

If (the jth link has bandwidth not reserved by any path) 

LS\j]<r-l; 

ElseLS[/]<-oo; 

/* i.e., the jth link does not have any free capacity; *1 

End If 

Else ZrtS'lj] ^ — Oj 

End If; 

End For; 

End For; 

Return LS; 

Fig. 8.10 illustrates the symbols used in the pseudo-code. There are three 
working paths (i.e., WD^[1], VFD^[2]and WDi,[3]) in the illustration with at 

least one link in common with the newly arrived working path W. The 
pseudo code inspects iteratively for each link traversed by W (i.e., Lw[fl). 
The minimum demand for spare capacity at link L\j\ to protect the three 
working paths is the maximum of the spare capacity demand among the 
working capacity of each link taken by W. 
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Fig. 8.10. An illustration of the symbols used in the pseudo code. 

In the above pseudo-code, the difference between s and v is the amount 
of spare capacity that can be shared by the protection path of W. This is also 
the sharable spare capacity which can be taken for no cost. With the vector 
LS, Dijkstra’s algorithm has full knowledge of which link has sharable spare 
capacity and which does not, and the best protection path can be solved upon 
the re-configured link-state topology using the approach. 

8.4 Dynamic Allocation of Protection Domains 

We have enumerated the advantages of using the Short Leap Shared 
Protection (SLSP) framework in optical networks in Chapter 6, which 
include scalability, adaptability and generalization. This section presents a 
dynamic survivable routing scheme based on the framework of SLSP. We 
will focus on the implementation issues and performance evaluation in the 
dynamical allocation of protection domains for each connection request. 

The merits of SLSP are clear; however, achieving a scalable and 
capacity-efficient allocation of P domains between an S-D pair is difficult 
since the number of feasible allocations has an upper boxmd exponentially 
growing with the distance between source and destination. To solve this 
problem we introduce a heuristic algorithm formulated to perform a diverse 
routing on a series of chained PSL-PML pairs. The PSL-PML pairs can be 
derived by selecting nodes along an alternate path coimecting each S-D pair. 
The problem is sub-divided and solved in the four steps listed below: 

1. Pre-plan several paths connecting each S-D pair as alternate paths in the 
network. 

2. Define a series of intersected PSL-PML pairs along each alternate path 
with a fixed distance. Step (1) and (2) are planned off-line. 
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3. As a connection request arrives, perform a constraint-based diverse 
routing between the PSL-PML pairs along each alternate path to yield 
working and protection paths in each P domain. This process can iterate 
either upon all the alternate paths or until an allocation is found to meet 
the pre-defined requirements. 

4. Select the combination of P domains with the least cost. 

A slight modification is made to the FTSA algorithm introduced in 
Section 8.1 in that a working path is routed between PML(i-l) and PSL(i+l) 
while the corresponding protection path is routed between PSLi and PMLi as 
shown in Fig. 8.11. To perform path selection in a distributed manner, every 
potential source node has to keep track of the distribution of all working and 
protection paths in the network. PSLs along an alternate path behave as 
source nodes that activate the diverse routing subroutine locally, which 
makes the diverse routing process more time-efficient. 

As a connection request arrives, the alternate paths corresponding to the 
request are inspected one after the other to find a qualified/optimal allocation 
of P domains. The allocation of P domains between PSLi and PMLi can be 
formulated as a diverse routing problem as shown in Fig. 8.1 1, where 1 < i < 
Im, and /„ is the number of P domains along the mth alternate path. The 
working and protection paths are diversely routed between PSLi and PMLi 
with the following constraints: 

1. PML(i-l) and PSL(i+l) have to be traversed by the working path. 

2. A protection path has to follow the SRLG constraint based on the 
corresponding working path segment. 

3. Working and protection paths are in the same wavelength plane if no 
wavelength conversion is allowed. 

4. The other customized constraints, such as the upper bound on the size of 
the P domains. 




Fig. 8.11. The diverse routing process is performed on the ith P domain 
on the kth wavelength plane if the wavelength charmels between PSLi and 
PML(i-l ), and between PSL(i-l ) and PSL(i+l), are available. 
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The results of the diverse routing between PML(i-l) and PSL(i+l), along 
with links PSLi - PML(i-l) and PSL(i+l) - PMLi, constitute the ith P 
domain. Note that the working path selected in the ith P domain does not 
necessarily coincide with the segment between PML(i-l) and PSL(i+l) of 
the pre-planned path, since dynamic traffic and link states are not considered 
along the alternate paths. 

We do not consider wavelength conversion because networks with partial 
wavelength conversion capability can be modeled as wavelength graphs to 
form a single-layered graph via the approach demonstrated in Chapter 4. 
However, networks without any wavelength conversion capability can be 
treated as independent graphs for each wavelength plane. Therefore, without 
loss of generality, we assmne in the following analysis that there is only a 
single wavelength plane existing in the network. 

We define the working path segment in the ith P domain as the path 
segment starting from PSLi to PSL(i+l) if i < and proceeding from PSLi 
to PMLi if i = /„. We also define the cost function Cf^ = CW” + CP^" , where 

CW” and CP"^^ the cost for working and protection paths in the ith P 
domain along the mth alternate path, respectively; andcj^.^is the cost for the 

ith P domain along the mth alternate path. The cost for working path W to 
reserve a wavelength channel (i.e., the jth link) by a working path segment in 
the ith P domain along the mth alternate path is defined in Eq. (8.11): 



CW. . = 

t,j 



oo if the channel is not reservable 
1 otherwise 



( 8 . 11 ) 



Give the working path W, the cost of reserving a wavelength channel 
along the jth link as spare capacity by a protection path segment in the ith P 
domain along the mth alternate path is defined in Eq. (8. 12) 

00 if the channel is not reservable 

1 if an extra wavelength channel is needed ^2) 

0 if no extra wavelength channel is needed 

and CP"’'^ are defined as follows: 



^Pij = 



CW” = 

j 



(8.13) 
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cpr=lcprf 



Therefore, we get the cost of all working and protection paths in each 
domain: 




Fig. 8.12 is a flowchart demonstrating the iteration of the path selection 
process. Each path in the K-shortest path set between PML(i-l) and 
PSL(i+l) is taken as a candidate working path and inspected in every 
iteration for all i. Note that PML(i-l) and PSL(i+l) for all i have been pre- 
assigned by the source node. For each candidate working path between 
PML(i-l) and PSL(i+l), the algorithm iteratively finds the corresponding 
protection path using ITS A. The ending criterion can be (1) the time spent, 
(2) the number of candidate working paths inspected, or (3) an inequality 
such as C,^orded+CPrecorded^C^em+3 , WhcrC Crecorded and iOit the 

costs of the working and protection paths recorded, Ccunem is the cost of the 
currently examined working path, and 6 is the tolerance of the optimality. 
The optimal solution can be obtained if The idea 

of criterion (3) is exemplified in Fig. 8.13. In case the recorded working and 
protection path yields a cost sum of 5 (i.e., = 5 ), the 

algorithm does not have to inspect the working path with a cost larger than 5 
since there will not be any path-pair with a cost sum less than or equal to 5 if 
longer working paths are inspected. The adoption of tolerance S makes the 
iteration stop after the derivation of a W-P path-pair with cost sum (5 + ^ ). 
In the simulation conducted in Section 8.4.1, criterion (2) and (3) are 
adopted, in which the decision is made after 20 working paths are examined 
with a tolerance of 20% cost of the recorded working path. 

To summarize the algorithm of P domain allocation, the flowchart in Fig. 
8.14 shows the iterative process for deriving the shared W-P path-pair and 
the Cascaded Diverse Routing. M is the number of candidate working paths 
prepared for each node-pair. Note that the links between PSLi and PML(i-l) 
are equivalent to the links between the (D i)th node and (D i + l)th node, 
where D is the diameter of a P domain and f = 1 to 1. 
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Fig. 8.12. Flowchart for deriving a protection path connecting PSLi and 
PMLi in the ith P domain. 

recorded 
^recorded 

^current 

5tof 



Fig. 8.13. An example of criterion (3). The recorded working path has a 
cost of 3 and the protection path has a cost of 2. The iteration comes to an 
end after a working path with a cost of 5 is inspected. In this case, the total 
cost can never be improved by inspecting more candidate working paths 
even if a zero-cost protection path is derived. 
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Fig. 8.14. Flowchart of the iterative process for deriving the cascaded 
shared W-P path-pairs in the P domains along alternate paths. 

8.4.1 Simulations 

The following simulation examines network performance in terms of 
blocking rate of building up working paths with end-to-end protection 
mechanism (either path-based, link-based, or SLSPn, where n > 2) between 
any node pair. The notation “SLSPn” is defined as the experiments on SLSP 
with each domain diameter of n hops. 

8.4.1.1 Assumptions 

We examine the protection strategies listed above in 22-node, 30-node, 
79-node and 100-node networks. Without loss of generality, each connection 
is assumed to be a bi-directional path with a uniform traffic volume between 
each node pair, which has to be equipped with an end-to-end shared 
protection service. 

The networks are assumed to have two fibers between adjacent nodes in 
each direction, which are contained in a conduit. Each fiber contains 16 
wavelength channels with the same bandwidth. Each connection request is 
an establishment of a lightpath between two nodes, which arrives according 
to a Poisson process and departs after a period of time defined by an 
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exponential distribution function. All the lightpaths in a conduit are in the 
same SRLG (i.e., they share the same risk of failure). The adoption of 
network resources for protection succumbs to the restriction of wavelength 
continuity; in other words, no wavelength conversion is allowed in the 
simulation. 

A blocking is counted if a working lightpath cannot be established with 
an end-to-end protection. After the connection is terminated, all the network 
resources reserved by the connection are released. In this simulation, each 
trail has 10,000 connection requests under a load condition set by each S-D 
pair. Data is derived by averaging the results of 4 trials. We assume every 
node has a potential traffic demand to all the other nodes modeled in arrival 
and departure rates. 

8.4.1.2 Simulation Results 

Fig. 8.15 shows simulation results in terms of blocking probability under 
different traffic load and protection strategies. The path-based protection 
yields the worst capacity efficiency, and is outperformed by the SLSP with 
proper sizes of P domain. Table 8.1 below shows the topology information 
of each network and a summary of the simulation results. For the small 22- 
node and 30-node networks in which the average distances between each S- 
D pair are 2.49 and 2.71, the best performance in capacity efficiency is 
achieved by using SLSP2 (i.e., the link-based protection defined in this 
study). For the 79-node and 100-node networks in which the average 
distances between each S-D pair are 6.57 and 9.5, the best capacity 
efficiency is achieved by using SLSP3 and SLSP4, respectively. 

We can clearly see from Fig. 8.18 that the SLSP2 outperforms most of 
the other schemes, especially in smaller networks, because more resource 
sharing has been achieved. However, better sharing of network resources 
does not guarantee better performance all of the time since the SLSP2 
intrinsically takes more protection links than the other schemes. The above 
observations explain why the best performance is yielded in the four 
networks with SLSP of different diameters, as shown in Table 8.1. The 
diameter of P domains determines the performance of networks with 
different sizes, by compromising the effects of the resource sharing and the 
number of protection links taken by a working path. Note that under the 
wavelength continuity constraint, only a multi-fiber network can benefit 
from the relaxation of the SRLG constraint because it has been assumed that 
the working and protection paths must be in the same wavelength plane. In a 
single fiber network, two working lightpaths traversing the same physical 
optical network element can never share the same protection wavelength 
channel, since they need to be on different wavelength planes. 





Blocking Prob. vs. Ijoad (22-node) 



Brocking Prob. vs. Load (30-node) 



100 . 00 % 



10 . 00 % 



Blocking Prob. vs. Load (79-node) 



10 . 00 % 



Blocking Prob. vs. Load (100-node) 



100.00P^ 



-I-SLSP2 

-i^SLSPS 

-^SLSPB 

-SeSLSPS 

-e-Pastvbased 



10 . 00 % 



Fig. 8.15. Simulation results on the SLSP for the blocking probability 
with different traffic load and protection strategies. 
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No. 

Node 


No. 
of link 


Avg. nodal 
degree 


Avg. 

distance 


Best 

diameter 


Size of 
P domain 


22 


44 


4.0 


2.49 


2 


4.6 


30 


63 


4.2 


2.71 


2 


4.5 


79 


108 


2.73 


6.57 


3 


10.3 


100 


179 


3.58 


9.5 


4 (or 5) 


16.9 



Table 8.1. Summary on the topological information and the diameters of 
P domains that achieves the best efficiency. 

As shown in Chapter 4, the S-D pairs with relatively long distance apart 
will encounter a larger probability of blocking, and may impair performance. 
This is because not only a working path needs to be located, but also the 
corresponding protection path has to be found before a successful allocation 
can be announced. This means we need a more stringent criterion for the 
allocation of connection requests across S-D pairs with relatively larger hop 
count apart. An example is given in the simulation results for the 22-node 
network shown in Table 8.1. The SLSP4 outperforms the path-based scheme 
by 6% to 7% in blocking probability under all provisioning traffic loads. 
According to our data, between 76% (when traffic is heavy) and 81% (when 
traffic is light) of the connection requests have working paths equal to or less 
than 4 hops. The rest of the working lightpaths take more than 4 hops, which 
yields the difference between the performance of the path-based scheme and 
the SLSP4. One can see that the connection requests with working lightpaths 
longer than 4 hops can hardly be successfully allocated with path-based 
protection. The segmentation of the long working lightpaths improves this 
situation, and yields a much better performance in blocking rate. 

Fig. 8.16 shows the computation complexity versus the size of P domains 
to demonstrate that SLSP takes advantages of local computation with 
guaranteed efficiency, and cooperate with the diverse routing algorithm 
adopted. The y-axis is the number of iterations needed for completing 
diverse routing within a P domain, which dominates the computation time. 
The x-axis is the normalized P domain diameter relative to the average 
distance of each network. It is shown that, with the derived optimal P 
domain diameter, the average number of iterations for deriving the working 
and protection path-pairs can be close to 1, which means that the working 
path is the optimal path between the PSL and PML in each P domain most of 
the time. Segmenting working paths for each connection request can largely 
decrease the computation complexity. 
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Computation Complexity vs. P domain Size 



0.27 0.55 0.82 1.09 1.37 1.64 1.91 2.19 

Normalized P Domain Diameter with the Average 
Distance of the Network 



Fig, 8,16, Computation complexity versus the normalized size of P 
domains in the networks, 

8.5 Summary 

This chapter discussed dynamic survivable routing schemes for both 
dedicated and shared protection. A number of approaches have been 
presented for solving the problem of finding asymmetrically weighted node- 
disjoint path-pairs. For dedicated protection, working and protection paths 
can be solved upon the same link-state and constraint since both of the paths 
reserve free capacity along a link. The approaches presented in this chapter 
to solve optimal or approximating optimal solution of node-disjoint path- 
pairs were based on k-shortest paths, Suurballe’s algorithm. Single Flow 
Relaxation, Integer Linear Programming, and Integer Linear Programming 
Relaxation. For shared protection, as the selection of working and protection 
paths is subject to different constraints, a successive derivation of the two 
paths is necessary. We introduced approaches based on k-shortest paths, 
called Iterative Two-Step-Approach, which focused on the performance 
evaluation of the cases with different values of a . The simulation results 
show that with a properly designed weight, a can successfully estimate the 
effect of resource sharing on the network by using the cost function 
aC + CP. 

This chapter also demonstrated a particular implementation of dynamic 
survivable routing, which solved the problem of allocating protection 
domains under the Short Leap Shared Protection (SLSP) framework. An on- 
line algorithm called Cascaded Diverse Routing (CDR) was provided to find 
optimal working and protection paths for each P domain. A simulation-based 
comparative study was conducted on the dynamic protection domain 
allocation scheme and the traditional shared protection schemes. The 
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simulation results show that SLSP, with properly assigned domain 
diameters, outperforms path-based and link-based protection in capacity 
efficiency. The best diameter for achieving optimal network performance 
depends on the size and topology of the network. We conclude that the 
dynamic protection domain allocation scheme generalizes the traditional 
path-based and link-based protection, and can adapt to a changing network 
environment with strict restoration requirements. 
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Questions: 

1. Explain why we put different weighting on the cost of working and 
protection paths. Why do we need to use heuristics to solve the diverse 
routing problem with asymmetrically weighted working and protection 
path-pairs? 

2. Describe Suurballe’ s algorithm with illustrations. 

3. Can Suurballe’s algorithm solve the diverse routing problem in shared 
protection? Explain your answer. 

4. What is the computation complexity when using Suurballe’s algorithm 
for solving link-disjoint path-pairs in a network with N nodes and M 
links, and average nodal degree D? What is the computation complexity 
if a node-disjoint path-pair is desired? 

5. What are the two major disadvantages of using the Two-Step-Approach 
in solving the diverse routing problem? 
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6. Why do we sometimes need to put weight on the cost of working path 
when solving a diverse routing problem for dedicated protection? 

7. Linear Programming Relaxation (LPR) can improve the computation 
complexity of Linear Integer Programming, but it also introduces some 
problems. What are the problems with using LPR? What constraint does 
the Linear Programming Relaxation relax as shown in this chapter? 

8. What is the physical meaning of flow split? Why can we not allow a flow 
to split in WDM networks? 

9. Why do we need to take two different topological link-states to calculate 
working and protection paths separately in shared protection? 

10. Consider the network topology shown in Fig. Q. 10 and the distribution of 
working and spare capacity listed in Table Q.IO. 

(a) Use the matrix approach to derive the minimum amount of spare 
capacity required along each link to achieve 100% restorability. Write 
the working and protection path-link incidence matrix and the spare 
provision matrix. 




link 1 
link 2 
links 
link 4 
link 5 
link 6 
link 7 
link 8 
link 9 
linklO 
link 11 
link 12 
link 13 



Fig.Q.lO. 



Working 

Paths 


Protection 

Paths 


A-B-C-D 


A-E-F-G-H-D 


B-F-J 


B-C-G-J 


B-A-E 


B-F-E 


E-J 


E-F-J 


H-G-J 


H-D-C-B-F-J 




''able Q.IO. 
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(b) When a connection request across the S-D pair (B-H) for a working- 
protection path-pair arrives, the selection of a working path is subject 
to several alternatives as shown in the following table. Complete this 
table. 





Working 

Paths 


Overlapped 
Working Paths 


Link cost (from linkl 
to 13) 


1 


B-C-D-H 


W1 


[1,00,00,00,1, 0, 0, 0, 
1, 1, 1.0,0] 


2 


B-F-G-H 


? 


? 


3 


B-F-J-G-H 


? 


? 



(c) According to the link cost derived in (b), solve the best protection path 
for each of the working paths in the above Table. 

(d) If every link has a cost 1, which one is the best working-protection 
path-pair in this case? 

11. With the pseudo code in Section 8.3.2, the topology link-state for solving 
the protection path corresponding to a working path can be derived. 
When we are allocating a protection path for working path W, do you 
agree that the minimum amount of spare capacity along a specific link is 
the bandwidth sum of all working paths that share risk of single failure 
with W? Explain your answer. 

12. We are given the network topology shown in Fig. Q.IO and the working 
paths listed in Table Q.12. The cost functions for solving working and 
protection paths shown in Eq. 8.10 are modified as follows. If a new 
connection request for a working and protection path-pair arrives across 
node E and node C, find the link-disjointed path-pair subject to the least 
total cost. 

Joo if link j is not reservable 
[2 otherwise 

2 if extra unit of spare capacity is required along link j 
1 if link i is shared by one protection path 

■< 

0 if link j is shared by more than one protection paths 
eo link j is neither sharable nor with enough free bandwidth 



CWj 



cpj 
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Working 

Paths 


Protection 

Paths 


A-B-C-D 


a-e-f-g-h-d 


B-F-J 


B-C-G-J 


B-A-E 


B-F-E 


E-J 


E-F-J 


H-G-J 


H-D-C-B-F-J 


E-F-G-H 


E-A-B-C-D-H 


F-G-H-D 


F-B-C-D 




Table Q.12. 



13. Fig. Q.13 shows a fraction of a network. The working paths traversing 
through at least one link of the network fraction are: W1 (A-B), W2 (E- 
C-D), and W3 (A-B-C-F). Now we attempt to allocate a new working 
path W along A-B-C-D as shown in Fig. Q.12. Let the bandwidth of Wl, 
W2, and W3 is Bl, B2, and B3, respectively. We are committed to 
finding the capacity distribution along the link G-H (or link j). 

(a) To restore any single failure possibly occurring to link A-B, how 
much spare capacity do we need to allocate along link jl (Hint: Use a 

binary indicator, Sj, , which is “1” if the protection path of path p 
traverses link j) 

(b) Continue (a), solve the minimum amount of spare capacity required to 
be allocated along link j when the working capacity along link B-C 
and C-D is considered, respectively. 

(c) We symbolize the minimum demand of spare capacity by the working 
capacity along link I upon link j as SB/ . Write down a general 

expression for deriving SB / . 

(d) What is 5^ , which is the minimum prohibited spare capacity to be 
reserved along link j for achieving 100% restorability given the new 
working path W? Write down a general expression for S'^ . 




Chapter 9 



OPTICAL BURST SWITCHING 



In the previous chapters, we have discussed a variety of topics on 
wavelength-routed networks. A lightpath is physically set up through a suite 
of signaling mechanisms by way of dedicated control channels before optical 
data is launched, and is tom down after the connection ends. As network 
traffic gets more dynamic and bursty, wavelength-routed networks suffer a 
fatal drawback in achievable throughput. The major problem with 
wavelength-routed networks lies in the fact that the reserved channels are 
held regardless of whether or not effective data is transmitted. We need to 
have a new switching type to achieve an ultra-high throughput and real-time 
provisioning of data transmission. 

Optical Burst Switching (OBS) architecture aimed improve both network 
throughput and provisioning speed. OBS cooperates with a suite of 
distributed signaling mechanisms, and is devised to support real-time 
multimedia traffic on the Internet. It provides solutions to network control 
with ultra-dynamic traffic, and a transition between circuit- and packet- 
switching architectures in the optical domain. Since the traffic pattern is 
often self-similar or bursty, OBS is especially efficient in providing high-end 
users or applications with high bit-rate, low latency and short duration 
sessions. The expense of the high throughput and dynamics is a result of 
high complexity in signaling mechanisms, where precision in time 
measurement for fast configuring of optical elements is required. 

This chapter explores architecture and implementation issues in OBS, 
and is organized as follows. In Section 9.1, the three basic switching 
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paradigms in the optical domain - wavelength-routed, packet-switching, and 
burst-switching - are introduced. Section 9.2 introduces the two OBS 
signaling and resource reservation protocols: Just-in-time (JIT) and Just- 
enough-time (JET). A detailed discussion and analytical study of connection 
setup and signaling mechanisms then follows. A data-channel scheduling 
algorithm called Horizon (a JTT support), and along with a suite of schemes 
that can differentiate best-effort and QoS data-bursts under the JET structure 
are also presented. Section 9.3 surveys the proposals of node architectures 
for implementing the OBS. Section 9.4 summarizes the chapter and 
compares the three switching paradigms proposed for the transportation 
plane in the optical domain. 

9.1 Introduction to Switching Techniques 

The migration from telephony circuit-switching architecture to packet- 
switching technology is a huge revolution in data communication networks, 
which has been greatly facilitated by advances in hardware-based VLSI 
manufacturing and switching capability. The difference between circuit- 
switching and packet-switching is mainly in the way in which the control 
packet is sent for configuring switch fabrics in the nodes along the pre- 
defined physical path. In the case of circuit-switching, a control packet for 
setting up a circuit is sent before a data-burst is transmitted along a dedicated 
channel (or out-of-band). Once the circuit is set up, no other connection 
sessions can use the reserved resources until the session is finished. In this 
case, the bandwidth is wasted if there is no effective data transmission 
during the session. With packet-switching, however, the control packet with 
a fixed format is sent together with the segmented and variable-length data 
(e.g., an IP packet or ATM cell), which forms a single packet containing a 
header and payload. Nodes along the pre-defined physical route forward the 
packet by inspecting the header of the packet through a hardware-based 
approach. During the header inspection period the payload is buffered in the 
node. In this case, any network resource (e.g., switching fabric, link or 
component) is dedicated to transporting the packet only when the packet is 
traversing across the network resource. Therefore, packet-switching can 
achieve a much higher throughput than circuit-switching, especially when 
the traffic is bursty and self-similar. 

With the emergence of DWDM technology, the amount of data carried 
along single fibers has been increased by thousands of times (i.e., over 10 
Tbps), which puts stringent demands on die switching capacity of IP routers. 
However, when requirements for the switching capacity of IP routers are 
increased, a scalability problem arises when commercially available high- 
speed IP routers (with a bandwidth ranging from 120 to 250 Gbps) are used. 
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Therefore, a significant gap exists between the transmission capacity of 
DWDM fibers and the switching capability of electronic IP routers. In 
general, current IP routers consist of a control box for handling control 
si gnalin g, and an OXC in the bottom transport layer for forwarding traffic. 
The OXCs are attached with link cards as an interface to the input fibers, 
which provide SONET/SDH or point-to-point IP/DWDM interfaces and IP 
switching functions. The most commonly used switch core is in the form of 
multistage interconnection architecture such as Benes network (which can 
construct strict-sense non-blocking Cantor networks), and has been widely 
implemented through guided-wave photonics switching techniques. 

The multistage design of a switch can substantially reduce physical size, 
and enhance the robustness of the terabit IP router. However, the line cards 
will increasingly become a dominant factor in the total cost of the router as 
the router size increases [1]. To overcome this bandwidth gap, we need a 
new switching architecture which can meet the requirement of provisioning 
bandwidth as well as can provide a cost-efficient approach of 
accommodating different types of legacy switching mechanisms (i.e., some 
existing switching types or transmission protocols such as SONET and 10 
Gigabit Ethernet). 

The solution to the above problem lies in the advance of photonic 
technology, by which a switch can be built such that optical IP packets are 
transmitted directly over WDM links on an end-to-end transparent optical 
path without any 0/E or E/0 conversion. In terms of control and 
management, the approach should avoid functional overlaps that may 
emerge when adopting any intermediate layer between IP and WDM layers, 
such as SONET/SDH or ATM. An integrated control plane is necessary to 
ensure that IP packets are directly launched onto the fiber optics. However, 
photonic inspection of the packet headers and buffering of the payloads in 
the optical domain are still not practical at the present stage. Optical 
buffering can be provided by equipping nodes with Fiber Delay Lines 
(FDLs) to a very limited extent (e.g., tens ns) in comparison with any 
electronic buffering, which is far away from being commercialized. The 
processing of packet headers in the optical domain is even more unlikely to 
be feasible in the near future. Therefore, the control of optical switches is 
implemented through electronic processing along with a very limited amount 
of optical buffering for packet payloads. Since optical flows are very fast 
while electronic processing is relatively slow, the overhead for 0/E 
conversion and electronic processing is relatively large. To reduce the header 
inspection duration, several IP packets with the same destination and similar 
dropping policy (or QoS requirements) are grouped and sent together (i.e., 
the switching granularity must be much larger than IP packets to reduce the 
overhead of inspection). This consideration leads to the concept of Optical 
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Burst Switching (OBS), which has been initially reported in [1,2, 3,4, 5]. The 
grooming/assembling of IP packets is termed burstification, which is an 
important functional block in realizing the OBS architecture. A data-burst is 
simply a group of IP packets that are sent along with a single burst header. 

The development of Burst Switching emerged in the early 80’ s [6,7], and 
was devised to initiate a compromise between the characteristics of circuit- 
and packet-switching networks. Circuit-switching is efficient only for data- 
bursts that are much longer than the setup time. Packet-switching has 
negligible setup time, which is achieved by embedding the control 
information into the data-burst. However, in an all-optical WDM switch, the 
header processing must go through 0/E conversion, which is subject to 
much higher cost, and also yields significant latency compared with speedy 
optical data flow [5]. With Optical Packet Switching, a node requires 
significantly large FDLs on a per-wavelength channel basis, so that 
expensive 0\E\0 conversion and electronic buffering for the data payloads 
can be avoided. 

In theory, burst switching is similar to a connectionless mode 
transmission or a packet-switching technique, but it differs from the latter in 
that it implies an intent to establish a switch connection near real-time so that 
only minimum buffering is required at the node switch. The basic idea of 
implementing the burst switching architecture is to reserve the required 
network resources right after the corresponding control packet arrives (i.e., 
Just-In-Time (JIT)) [5], or by a precisely calculated amount of time right 
before the arrival of the data-burst (i.e., Just-Enough-Time, (JET)) [2,3]. 
Therefore, resource utilization can be more efficient at the expense of much 
complexity and high requirements for precision when measuring time 
durations. For this purpose the switching bandwidth of nodes, packet size, 
and the speed of traffic transported in the medium, have to be considered and 
are important to the highly coordinated systems. In other words, the high 
throughput comes at the expense of the requirements of a higher precision in 
time-measurement (or coordination) and protocol robustness for achieving 
the highly value-added burst-switching-based networks. 

The major difference between packet- and burst-switching is that packets 
are sent at a full link speed, while bursts are transmitted only at a TDM 
chaimels speed [6,7]. In addition, the length of data-bursts can be arbitrary 
with a delimiter at the end of the burst. OBS is distinct from the above two 
switch types by taking separate wavelengths to transmit data-bursts (or 
payloads) and their headers to make better use of both mature electronic 
technologies and new optical technologies [1]. In general, the burstification 
scheme can be set such that the size of a burst is exponentially distributed, 
follows predetermined rules for improving network performance (e.g., burst 
dropping probability). 
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The advance in optical communications is illustrated in Fig. 9.1. The 
earliest optical networks were limited to point-to-point transmission or ring 
architecture due to lack of switching devices. However, the use of fiber 
optics in communication systems with WDM technology tremendously 
increased the transmission bandwidth. The realization of mesh networks has 
been supported by the advance of photonic switching technology with ultra- 
high bandwidth, and largely increased the network capacity. As for Optical 
Burst Switching, the very high transmission and switching capacity are 
further accompanied by ultra-high switching speed, where fast and efficient 
traffic provisioning can be achieved. Label Switching is a technology 
migrat^ from the existing IP/MPLS networks, and can be used in control of 
the optical domain because of its clean separation between control and 
transportation planes. Therefore, it can fit into a framework of any switching 
types as an overlay. Optical Packet Switching will be the core technology of 
the next generation Optical Internet, which is, however, still at the 
experimental stage. The OBS is subject to much interest because it serves as 
transitional technology in the gap between state-of-the-art photonic 
technology and future bandwidth and service demands. 
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Fig. 9.1. The advance in optical fiber communications systems. 



To summarize the fundamental characteristics of the OBS paradigm, the 
following descriptions are outlined: 

1. The switching granularity is between circuit- and packet-switching, 
which leads the OBS to take the better of the two switching paradigms 
for WDM networks. 

2. A clean separation is made between control information (header) and 
payload (data-burst), in which different wavelength channels are taken by 
control and transport purposes. 
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3. The lengths of data-bursts are variable depending on the burstification 
schemes. 

4. Buffering is not typically necessary. However, a moderate amount of 
buffering can significantly decrease the burst loss rate and increase the 
flexibility of network resource scheduling. The buffering can be either 
through an O/E/0 conversion or all-optical fiber-delay lines. 

The advantages of OBS, as compared with electrical Burst Switching, are 
as follows: 

1. A significant increase in discrepancy between optical transmission speed 
and electronic switching capability. 

2. Traffic flows can be entirely kept in the optical domain, which improves 
system integrity and saves cost and control complexity. 

We will add details to these advantages in the following paragraphs. 

9.2 Reservation Schemes 

Resource reservation is a mechanism conducted after a routing decision 
is made for a connection request in the path selection process. Signaling is 
issued to set up a bandwidth-guaranteed light path for the connection 
request. Communication sessions using Circuit- or Burst-Switching 
technique are typically subject to the course of accurate reservation of 
bandwidth along selected paths through extensive signaling mechanisms, 
while the Packet-Switching technique supports statistical multiplexing and 
can resolve the resource contention through some other approaches (such as 
deflection or any dynamic reconfiguration scheme). The way of reserving 
network resources differentiates the three switching types. 

This section introduces resource reservation schemes for the three 
switching paradigms in the optical domain: Lambda-Switching (or circuit- 
switching), Optical Packet Switching, and Optical Burst Switching. Our 
attention is focused on the efficiency of reservation and signaling 
mechanisms for each switching type. Without loss of generality, the 
following discussions will be based on the common assumptions and 
notations listed below. 

1. The propagation delay for each wavelength link is identical (i.e., the fiber 
links are of the same lengths). 

2. The processing delay of control packets (i.e., setup and release) at all 
nodes (source, intermediate and destination switches) are identical. 

3. The time durations (or parameters) of interest in the course of data-burst 
transmission are listed as follows: the source setup time (abbreviated as 
SST, which is defined as the time duration for a data-burst from the 
instant being present at the source node to the instant that the first bit of 
the data-burst is sent), the channel holding time for a successful data- 
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burst transmission (i.e., the time duration from the instant a channel is 
reserved to the instant the chaimel is released), and the end-to-end latency 
(abbreviated as End2End, which is the duration from the instant the data- 
burst presents itself for transmission at the source until it is completely 
accepted at the destination). SST strongly depends on the number of 
intermediate nodes along the path and the type of reservation adopted. 

4. Associated notations are as follows: 

tf : propagation delay from an access station to its attached WDM switch. 

tp : protocol message processing time at each node (intermediate WDM 
switches, source, and destination). 

tc : cross-connect cut-through switching time at a WDM switch (or a run- 
length inspection time). 

ti : propagation delay on a fiber link between WDM switches. 
tb : data-burst transmission duration. 

The typical values of the above parameters for current generation of 
WDM switches are: fiber delay (ti) of about 5 micro-seconds per kilometer, 
signaling protocol processing time (tp) of a few micro-seconds, and cross- 
connect cut-through time (Q of a few milliseconds. If store-and-forward 
processing is adopted, the total signaling time has to include not only the 
propagation delay, but also the processing time by which the control packet 
is processed and inspected during the confirmation [6]. 

9.2.1 Resource Reservation in Circuit-Switching and Packet- 
Switching Networks 

We will first take a look at the resource reservation schemes for Circuit- 
Switching networks Fig. 9.2 shows the time diagram for the Circuit- 
Switching sign a l in g mechanisms. The cut-through mechanism can save 
some processing time by forwarding the control packet directly in the data- 
link layer after it is read by a node. However, the round-trip propagation 
delay incurred by the control packet dominates the setup time and impairs 
network throughput, especially when the connection call is short and bursty. 
In this case, a very high percentage of total transmission time (i.e., setup 
time and data transmission time) may be consumed in the traveling of 
control packets. This is also called a two-pass reservation, in which the 
control packet is required to travel back-and-forth between source and 
destination for setting up a circuit. 
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Fig. 9.2, Time diagram for resource reservation in Circuit-Switching 
signaling mechanisms. 

Resource reservation with Packet-Switching is similarly diagramed in 
Fig. 9.3. Unlike the Circuit-Switching case, Packet-Switching is not subject 
to any signaling process or propagation latencies. While a header is 
processed in a node, the payload is buffered and has to wait for the end of 
inspection. Therefore, the header can “cut-through” an intermediate node 
only if a layer-2 interface and forwarding protocol are adopted (e.g., label 
switching or cell switching). 
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Source Intermediate Destination 




Fig. 9.3. Time diagram for resource reservation in packet-switching 
signaling mechanisms. 

In the case of packet-switching, the source setup time is simply the source 
setup processing time (SST): = t^ , and the end-to-end latency: 

=2-^/ +(«-!)•?/ +(n + 2)-fp+n-r,+f, (9.1) 

The term 2 • allows the whole packet (header and payload) to be 
propagated from the source node to the core network, and from the core 
network to the destination node. The second term (n-l)-r, is the 
propagation delay for the packet as transmitted between nodes. The third and 
fourth terms represent the processing latency and cut-though latency along 
the path, respectively, hi the case of circuit-switching, the source setup time 
(SST) is given by the whole round-trip traveling time of the control packet, 
which is: 

= 4 • + 2 • (n - 1) • + (2 • n + 3) • + n • (9.2) 

In Eq. (9.2), the term 2-tj. is required because a control packet and its 
data-burst access 4 times at the source and destination nodes during a whole 
session. The term 2 • (n - 1) ■ stands for the round-trip delay of the control 
packet which traverses 2 (n-l) links. The term (2 • n + 3) • tp depicts the 
protocol processing time consumed by the control packet at each node. The 
last term of the expression assumes that every node traversed by the control 
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packet contributes a cut-through switching time to the whole latency. The 
end-to-end latency for the circuit-switching is: 

7’^2L=6-t/+3-(n-l)-t,+(2-n + 3)-r^+n-r,+r, (9.3) 

In comparing Eq. (9.1) and Eq. (9.3), it is clear that Packet-Switching can 
achieve shorter end-to-end latency with the same data-burst due to the 
longest SST required by circuit switching. If the payload is short in length 
(i.e., tb is small), then Circuit-Switching is subject to a significant overhead. 
The expense paid by Packet-Switching in efficiency of resource reservation 
results from the necessity of buffering the payload, the requirement of very 
high speed header processing, and an instant 0/E conversion. 

9.2.2 Resource Reservation in Optical Burst Switching (OBS) 

Optical Packet Switching incurs implementation difficulties when 
handling all-optical processing and buffering requests. However, Circuit- 
Switching (or wavelength-routed) is subject to inefficiency in reserving 
network resources when connection calls are short and bursty, where the 
overhead of setting up and releasing network resources may dominate the 
total end-to-end latency. The above two phenomena can be improved with 
the adoption of the OBS switching paradigm. OBS is characterized by its 
highly coordinated out-of-band signaling mechanisms that facilitate All- 
Optical transportation of data-bursts. A one-pass reservation scheme saves 
the signaling on-the-fly latency for achieving the maximum extent of 
provisioning dynamicity. 

The design objectives of OBS signaling are as follows. First, the protocol 
should support unidirectional point-to-point data transmission, where clients 
have the capability to originate and receive a point-to-point optical data- 
burst dynamically. Second, the protocol should support different classes of 
data-burst in the optical domain. Retransmission due to a blocking is 
allowed, but must be as rare as possible for critical data-bursts. Third, the 
protocol should support packet-type services (or bursty transmissions), as 
well as conventional circuit-switched services. The exact mode of operation 
(i.e., the type and class of service) should be explicitly decided by the 
senders. Fourth, the protocol should be independent from the routing and 
wavelength assignment scheme adopted, so that it can be accommodated to 
different network environments. Furthermore, the reservation scheme should 
be able to assign differentiated levels priority to data-bursts with different 
QoS requirements [5]. 

Two resource reservation schemes have been documented for OBS, 
called Just-in-Time (JIT) and Just Enough Time (JET). The following sub- 
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sections will present signaling protocols designed for the two OBS schemes 
in WDM optical networks. 

9.2.2.1 Just-In-Time (JIT) 

Just-in-time OBS is a resource reservation protocol with explicit setup 
and explicit release mechanisms using specific control packets, which first 
emerged firom the U.S. Governments Multi-Wavelength Optical Networking 
(MONET) Project in 1999 [5]. The setup and release of network resources 
are termed explicit for the following reason: The elements inside a switch are 
configured for an incoming data-burst immediately after the arrival of a 
control packet, and will not be tom down until another control packet for 
releasing the connection arrives. The control packet to release the network 
resources is also called an in-band terminator (IBT). Contrary to explicit 
setup and release, the reservation of wavelength channels can be made upon 
a specific duration of time by some calculation in advance, which is termed 
estimated setup and estimated release [8]. 

The time diagram for the resource reservation protocol with explicit setup 
and release is shown in Fig. 9.4. The resources (i.e., switching fabrics and 
wavelength converters, etc.) of the intermediate and destination nodes are 
reserved inunediately before the burst arrives, which is where the name 
“just-in-time” comes from. Since optical flows are faster than the control 
packets that are stored-and-forwarded by every intermediate node, the 
optical data-burst should be sent some time after the control packet is sent, 
so that the wavelength channel is set up before the burst arrives. If a burst 
arrives at a node before the node is configured, the burst will be discarded, 
which is also termed an early arrival phenomenon and will impair 
throughput and provisioning dynamics. But if the node is configured too 
much before the data-burst arrives, the network resources are also wasted 
since no other data-bursts can reserve the time slot on the node. Therefore, a 
data-burst should be carefully delayed by a certain amount of time after 
sending the initial control packet. The calculation of the time interval should 
take the number of intermediate nodes along the selected path and the 
processing time of the control packet by each of the intermediate nodes 
(which will be demonstrated in the following paragraph). Since the burst is 
sent without waiting for a round-trip acknowledgement, the scheme is also 
called a one-pass reservation. After sending the data-bmst, the source node 
then sends another control packet to release the reserved wavelength channel 
and switching fabrics of the nodes. 

In the case of JIT OBS, the SST (i.e., source setup time) is 
(n + 2) • tp + 1 j , where tg>t^-tp. tg is the extra delay adjustment required 

to assure the completion of cut-through of the control packet at the last 
WDM switch [6]. The JIT uses the one-pass reservation, in which a data- 
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burst is sent without waiting for any acknowledgement. Note that a blocking 
may occur if the control packet fails to reserve the network resources in any 
intermediate node. In this case, the corresponding data-burst will still be sent 
after an interval equal to SST. But the data-burst will be discarded midway, 
as shown in Fig. 9.5. The intermediate node where the control packet failed 
to reserve a wavelength channel sends back another control packet for 
releasing the network resources that have been reserved for the data-burst 
transmission. The resumption of the data-burst transmission will not begin 
until the source node receives an acknowledgement from the destination 
node (i.e., a round-trip delay will be incurred for any reservation failure, 
which is equivalent to the delay caused by a reservation failure in the circuit- 
switching case). The blocking probability of transmitting data-bursts is an 
important performance index for OBS, which relates to the throughput and 
provisioning dynamics of the network. The blocking probability can be 
improved by adopting FDLs in each node so that the data-bursts subject to 
contention can be temporarily buffered for several tens of nanoseconds to 
resolve the contention. The node architecture equipped with FDLs and its 
advantages will be discussed later. 



Source Intermediate Destination 




Fig. 9.4. Just-in-time signaling mechanism. 
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Fig. 9.5. The control packet failed to make a reservation on the second 
intermediate node. 

Instead of sending a control packet to release the network resources at the 
end of data transmission, JIT can also alternatively set up the reservation 
duration in each intermediate node by estimating the time consumed by the 
data-burst to traverse through each node. Then, if the reservation duration is 
correct, each intermediate node automatically releases the switching fabrics 
reserved by the data-burst the instant the data-burst has entirely passed the 
node. Alternatively, limited time duration before a reserved wavelength 
channel is released can simply be defined for each node. In other words, the 
wavelength channels taken by a data-burst are automatically released after 
pre-defined time duration of reservation. In general, the derivation of the 
time duration should either consider the size of data-bursts, or the diameter 
of the network - an issue that needs to be well-elaborated to achieve better 
resource utilization. The above two alternatives are also called estimated 
reservations. Since each data-burst is variable in length, setting up a long 
reservation time may lead to a waste if the lengths of most data-bursts are 
considerably shorter. However, setting up short reservation durations can 
nonetheless limit the size of data-bursts to be launched into the network, and 
also increase the overhead of transmitting each data-burst. 

9.2.2.2 Channel Scheduling in JIT OBS 

One of the main concerns of resource management in OBS is to scheme 
each wavelength channel in an efficient way. Channel scheduling schemes 
are devised to diminish contention when more than one data-burst intend to 
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reserve an input/output port, switching fabric, or Fiber Delay Line (FDL), so 
that the data-burst dropping rate can be reduced. Here the FDL is taken as an 
optical component which can delay/buffer an early-arrived data-burst for a 
specific duration of time, most often less than 10 ns. Although the use of 
FDLs can improve network resource utilization and decrease data-burst 
dropping rates, we need a well-designed data channel scheduling algorithm 
to make the most use of those expensive network components, where optical 
fabrics and FDLs in a node can be allocated to different connection requests. 
The design of a data channel scheduling algorithm is always a challenge, as 
they must determine whether a data-burst should be immediately assigned a 
wavelength channel, or delayed/buffered in a node, or directly discarded as a 
contention occurs. The most notable difficulty lies in the fact that the 
algorithm should be very time-efficient for an on-line decision-making while 
achieving an effective resource allocation/scheduling in a node for 
minimizing the fragmentation of wavelength channel availability in the time 
domain. One of the earliest and most common data channel scheduling 
algorithms is the Horizon algorithm [4], which is very similar to the Latest 
Available Unscheduled Channel (LAUC) algorithm that emerged later [1]. 
In general, a data channel scheduling unit (or Switch Control Unit (SCU) in 
the following context) has to keep track of all the unscheduled time slots of 
each outgoing data channel (or wavelength). The basic idea of the Horizon 
algorithm is to minimize the fragmentation of time slots by selecting the 
latest available unscheduled data channel for the arrival of each connection 
request. Given an exact arrival time t„ and a burst with duration tb (in us) at a 
node, the SCU first tries to find all the available outgoing data channels that 
have not been scheduled between time ta and tg + h. If there is at least one 
such channel, the algorithm selects the latest available channel (or with the 
smallest time gap between the last unscheduled time and t^. The SCU then 
updates the available time of the selected channel to tg + h. For exan^le, in 
Fig. 9.5(a), the moment the control packet arrives at the node, data channel 
and D 2 are available at tg. The Horizon algorithm selects D 3 to carry the 
arriving data-bmst because t^-t 2 <t„-tj. 

The above descriptions do not consider the use of FDLs. In the event that 
FDLs are adopted, channel scheduling would be performed in a much more 
flexible fashion. Suppose an optical buffer has B FDLs, with the jth FDL 
able to delay j units, where j = 1, 2, ..., B, and a unit of FDL is d iw. If all 
data channels are not available between time tg to tg + tb, the arriving data- 
burst has to be delayed by a multiple of FDL units (e.g., i units), to find at 
least one unscheduled channel. If 1 < i < B , the scheduler will select the 
latest available channel (i.e., the smallest i) for the resource reservation, and 
update the unscheduled time of the channel to t + i • d + t^. ]f a. free time 
slot in i time units equal to or less than B cannot be found, the reservation 
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request of the control packet will be rejected at this node, and the following 
arrived data-burst will be discarded. Note that, as shown in Fig. 9.6(a), the i 
= 0 case is degraded to the case where no FDL is needed. Fig. 9.6(b) shows 
where FDLs are necessary for avoiding an immediate drop of a data-burst. 
At time and t^+d , all the channels are busy. The scheduler finds that D 2 
at the time t^+2-d is available, and then allocates the resource from 
tg+2-d to tg + 2- d + ti, foT the reservation request. 



Time 




(a) 



Time 




(b) 



Fig. 9.6. Illustration of Horizon (orLAUC) algorithm; (a) D2 is selected; 
(b)D3 is selected. 

9.2.2.3 Just-Enough-Time (JET) 

jrr is simple and straightforward in the explicit setup and release of 
network resources by control packets. We can further elaborate the scheme 
by designing the offset time between the instants of sending a control packet 
and the corresponding data-burst. Just-Enough-Time (JET) was first 
proposed in [2], where an estimated setup and release for each data-burst 
transmission are defined. In other words, an exact time period for 
wavelength channels reserved along a working path is calculated in the 
source node, and then assigned to each intermediate node by a control 
packet. The time diagram is illustrated in Fig. 9.7. 
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Source Intermediate Destination 




Fig. 9. 7. Time diagram for the JET reservation scheme. 

In JET, the SST is the same as in JIT, except that each data-burst is sent 
with a specific offset time (i.e., 6^ in Fig. 9.7) after the control packet is sent, 

H 

and the offset time SST can be equal to or larger than ^t^(h), where H is 

A=1 

the hop count along the path, and tp{h) is the protocol processing time on the 

H 

h-th node along the path. ^t^{h) is also called base offset time, which is 

h=\ 

the minimum amount of time the data-burst has to be delayed in the source 
node after the control packet is sent. 

JET has the following several characteristics that distinguish it from 
Packet-Switching and JIT-based OBS. With JET, since the data-bursts can 
be buffered in the source node, no FDLs are necessary at any intermediate 
node to delay the bursts while the control packets are being processed 
(although the use of FDLs can improve the blocking performance of JET 
OBS significantly). In addition, JET OBS can achieve more efficient 
bandwidth utilization by using delayed reservation, where the switching 
fabric of the ith node along the path is reserved since the arrival of the data- 
burst, instead of since the time the control packet is processed. This is also 
called an estimated reservation. Besides, instead of sending a control packet 
to release the reserved network resomrces along the path as the mechanism 
used by JIT, JET-based OBS specifies the exact time duration between the 
arrival and the departure of a data-burst in each intermediate node. The 
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estimation of reservation time is exemplified in Fig. 9.5. A data-burst arrives 
at the intermediate node at tp +^2 control packet’s 

arrival, and completely passes through the node at ip +^2 + ^6 
control packet’s arrival, where S 2 =6j-tp • The data-burst will arrive 
at the destination at tp +3^ time units after the arrival of the control packet. 
The destination will completely receive the data-burst at tp+S^+t^, where 
6j=Si-2-tp-ti (1) - ti (2) . This can be generalized to a path with H nodes, 

which gives the start and end time of the data-burst passing through the node 
relative to the arrival of the control packet: (n) = tp+3„ and 

.where S„ =3,-(n-l)-fp-^ti(i) for n = 2~H. 

i=\ 



9.2.2.4 QoS Support with JET OBS 

JET-based OBS is a resource reservation protocol with an estimated 
setup and release for a data-burst in each intermediate node along its path. 
Compared with the JIT OBS where the data channel scheduling is on a first- 
come-first-serve basis, JET provides more flexibility in managing the 
resource allocation. The flexibility is gained at the expense of more 
computation efforts, and taking up more buffering space at ingress nodes. 
Both JIT and JET OBS keep most buffering efforts (i.e., data-bursts buffered 
at ingress for the offset time) and electronic processing (i.e., traffic shaping 
and QoS scheduling, etc.) at the ingress nodes, while leaving the data-bursts 
transparent to the intermediate nodes. However, JET OBS can further isolate 
multiple classes of data-bursts with different QoS requirements by assigning 
different offset times. This is also called an offset-time-based QoS scheme 
and is one of the most notable merits in the JET OBS. By manipulating the 
offset time, the classes of service can be differentiated. The following 
paragraphs will discuss this mechanism. 

We have learned that a data-burst can be a collection of IP packets with 
the same destination and similar QoS requirements. JET-based OBS 
provides a basic QoS support for the IP over WDM networks in the optical 
layer, where differentiated classes of service are defined for data-bursts with 
different offset times. The delayed reservation equips the JET OBS with the 
ability to accommodate multi-service network environments and flexibility 
in resource reservation. Note that a data-burst can be sent at a minimum 
offset such that the data-burst will not catch up with the slower control 
packet traveling ahead, and will remain subject to store-and-forward 
processing at every intermediate node. The offset time is termed the base- 
offset, below which the data-burst may be discarded as it may arrive at a 
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node before the arrival of the corresponding control packet during the 
transmission. To show the class isolation mechanism, let t[ , and be 

the arrival time of the control packet, the service starting time of the data- 
burst, and the offset time for a reservation request req{i) at class i, 
respectively. Also let t[ be the burst length requested by req(j), where i = 0, 

1. We will see the reasons for that a class 1 request assigned with an extra 
offset time obtains a higher priority for resource reservation than a class 0 
request. It is clear that in an intermediate node, at any instant, a request of 
class 1 competes with only the other class 1 requests for resource 
reservation, while a class 0 request has to compete with not only the other 
class 0 requests, but also some class 1 requests that were issued beforehand. 
Fig. 9.8 illustrates the situation where a req(0) connection request is blocked 
due to contention. In (a), the control packet req{0) arrives at the node at 
r” for reserving time units of network resource. To successfully reserve 

enough resources for this data-burst, the time slot between (or 

and should be available. Therefore, a contention occurs in the 

following situations if some other control packets have arrived at the node 
beforehand to reserve a time slot: 

1. For a req(O) control packet that arrived at the node at , where , 
contention occiurs if <t^ , where t° is the burst length of the 

reg(0) connection request. As shown in Fig. 9.8(b), a region is defined 
along the time line for the arrival time of any class 0 request (i.e., req(Q) 
arrives at time ) such that a contention occurs with the req(0) (termed a 
region of contention for a specific control packet and offset time). For a 
req(l) control packet that arrived at the node at Fj , where i] , 

contention occurs if f ° 1 > f j - 4,,, - F* . 

As shown in Fig. 9.8(c), there is a range for the arrival time of req(l) (i.e., 
Fj ) that yields a contention with req(0) , and is marked along the time 
line. Note that the region of contention in this case is devised to facilitate a 
node to check whether or not a request message arriving at time can 

reserve the time slot from F® + to F® + T^^ei along the time line. 
We will see the case of a class 1 request req(l). 
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Fig. 9.8. An illustration of the region of contention for a class 0 request 
req{0) with the requests req(0)andreq(l). 

2. Fig. 9.9, illustrates how a req(l) connection request is blocked. In this 
case, any re^(0) connection request with <r* will never have a chance 

to content with the req{l) request due to the extra offset time . 

Only the other class 1 connection request, re^(l), with Fj <r‘ can possibly 
content with req(l). As shown in Fig. 9.9(b), the region of contention for a 
connection request req(l) is located on -tl < Fj < . 
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Fig. 9.9. An illustration of the range of contention for a class 1 request 
req{\) with only req{\) . 
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With the region of contention for a connection request, we can calculate 
blocking probability for the connection request at any node if the arrival and 
departure rates of data-bursts are available. In Fig. 9.8, the request reqiO) is 
blocked at a node if at least one request reg(0) arriving during the region of 

contention for class 0 (from time t° - to ) or at least one request reg(T) 
arriving during the region of contention for class 1 (from time 
~ ~ ^6 tO ^ *e arrival Of 

data-bursts follows a Poisson process, the blocking probability for any 
connection request at a node is l-exp(-/l„ •F“)-exp[-/lj -(F^ +t“)] , 
where \ and is the arrival rate of class 0 and class 1 data-bursts, 

respectively. With the same approach, we can derive the blocking 
probability in the case of a req(l) request. 

Conq)aring the above two cases, it is clear that the connection request 
with the extra offset time (i.e., req{l)) yields a smaller chance of blocking 

than does the case of req(0) if the traffic is uniform for both classes of 
request. With the difference, classes of service can be isolated. 

The above examples are based on the assumption that there is no FDL in 
each node (or no buffering provided). The offset-time-based QoS scheme 
can yield significantly better performance and system flexibility even with 
very limited FDLs equipped to resolve contention for bandwidth among 
multiple data-bursts. Let B denote the maximum delay an FDL can offer (or 
the longest FDL), which leads to a fact that a data-burst can be buffered up 
to a maximum B time unit. The region of contention can be much relaxed 
with the adoption of the DFLs. In Fig. 9.10(a), when reqifi) arrives (at t°) 
the wavelength channel is in use by another data-burst. Thus, the data-burst 
corresponding to req(0) has to be delayed for tj time units by an FDL. Note 

that the value of is ranged from 0 to B, and an FDL with an appropriate 
length that can provide a delay of tj will be chosen. Therefore, the region of 

contention for the req(0) request can be largely reduced, or even eliminated, 
if the length of the data-bursts is well controlled. If a req(l) request arrives, 
as shown in Fig. 9.10(b), an FDL is not required because of the extra offset 
time t' , . The difference between the reservation of the two classes is that 

the req(l) requests do not rely on FDLs to resolve contention as much as do 
req(0) requests. When there is a contention on FDLs, the req(0) requests will 
be subject to a higher blocking probability than req{V) requests. 
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Fig. 9.10. An illustration of class isolation between class 0 and class 1 
data-bursts with Fiber Delay Lines. 

9.3 Node Architecture 

In previous sections we introduced signaling mechanisms, data channel 
scheduling schemes for the JIT OBS, and the principles in QoS support for 
the JET OBS. In this section we will further discuss switching architectures 
for implementing OBS. Emphasis will be put on devises and design for 
contention resolution, such as the deployment of Fiber Delay Lines (FDLs). 

Basically, optical flows on an input fiber that enters a WDM switch are 
first demultiplexed along its frequency spectrum into independent per- 
wavelength flows. Then the per-wavelength flows with a same wavelength 
are sent to a right lambda-switching box (i.e., a lambda-switching box with 
the same wavelength) for space switching. After the space switching, the 
per-wavelength flows are sent to a right output port, and multiplexed by an 
optical multiplexer with the other per-wavelength flows of different 
wavelengths before leaving the node. The switch architecture of this process 
is shown in Fig. 9.11. The electronic switching box connects to the optical 
switching box through a transceiver/receiver array, and can either add/drop 
traffic data, or provide O/E/0 transformation for wavelength conversion or 
electronic buffering. Since the electronic buffering and O/E/0 
transformation are expensive and slow, the design of network resource 
allocation should avoid using O/E/0 conversion as much as possible. The 
use of FDLs can solve this problem. They are basically a segment of fiber 
with a specific physical length. If a per-wavelength flow is switched to an 
FDL it is delayed because extra physical distance is traveled. The FDLs are 
mainly used to resolve contention between different data-bursts that are sent 
to the same output port at a same time. 

Two switch architectures with FDLs are shown in Fig. 9.12. In Fig. 
9.12(a), the switch devise was proposed as part of the Wavelength Switch 
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Optical Packet Network Project [9]. The &st N input ports to the switching 
fabrics buffer the input optical data-bursts. The (N+l)-th to the (2N)-tft input 
ports are used for space switching. In Fig. 9.12(b), the switch architectiure 
was developed part of KEOPS projects [10]. Each incoming and outgoing 
port is intended to carry a single wavelength. The switch consists of two 
functional stages: contention resolution and space switching. Although the 
equipment of FDLs is also expensive and not scalable due to its operation at 
a per-wavelength basis, it can serve a transitional technology in the 
migration from opaque to all-optical networks. 




Fig. 9.11. Traditional lambda-switched optical cross connect for 
wavelength-routed WDM networks 

Fig. 9.13 illustrates a WDM switch that behaves as an P core router with 
FDLs. Without loss of generality, each directional link is assumed to have a 
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total of K wavelength channels (note that a directional link may have 
multiple fibers), of which only one channel in the directional link is used as a 
control channel to carry burst header packets (BHPs), and the remaining ^-1 
channels are used as data channels to carry data-bursts. Data channels are 
connected to the optical switching boxes of specific wavelengths while the 
control channels are terminated at the switching control unit (SCU). The 
input FDL is used to delay the arriving data-bursts, thus allowing the SCU 
sufficient time to process the associated BHPs, or to delay the data-bursts 
until a desired channel is available. Data-bursts still stay in the optical 
domain while being buffered in the optical routers. The purpose of allocating 
FDLs in the optical routers is to resolve contentions on outgoing data 
channels. Note that if the in-box FDLs are not provided, contention can be 
resolved by using slower resource-consuming electronic buffering, which 
could, however, decrease the switching capacity of the optical routers. 

The main functions of the SCU are to process BHPs, schedule their 
corresponding data-bursts to pass through the optical switching matrix, and 
forward BHPs to their next hops. The forwarding of BHPs can either follow 
a predetermined route (i.e., connection-oriented mode) or perform a hop-by- 
hop path searching process (i.e., connectionless mode), depending on the 
routing scheme adopted. In order to minimize blocking probability for each 
data-burst (which can be caused either by the early data-burst arrival 
phenomenon, or by contention with other data-bursts), the input FDLs and 
in-box FDLs should be properly dimensioned, based on the BHP processing 
time in the SCU and specifications of the offset-time-based QoS scheme 
(e.g., the extra offset time defined). Note that the input FDLs are not 
necessary if the delayed reservation mechanism keeps every data-burst no 
early than the corresponding control packet. 




Fig. 9.12. Two architectures for a lambda-switching box with embedded 
fiber delay lines. 
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Fig. 9.13. The fiber delay line (FDL) can improve the switching 
capability of an OXC and decrease the number of transceiver/receivers. The 
resources for electronic buffering can be saved. 

9.4 Summary 

The architecture of IP over WDM will undoubtedly become the 
architecture of the next generation optical Internet. In the meantime, the 
elimination of any intermediate layer from the ciurently commercialized 
Internet, such as ATM or SONET, wordd cause a loss of some important 
functionality in provisioning different classes of service and dealing with 
bursty traffic. Optical Burst Switching (OBS) can solve the above problem 
by providing dynamic management of network resources and a scalable 
class-isolation scheme. This chapter introduced the OBS paradigm, 
including its control schemes and some implementations of switch 
architectures. We discussed two important reservation schemes, Just-In- 
Time (JIT) and Just-Enough-Time (JET), and Horizon, a data channel 
scheduling algorithm. The region of contention in a node was defined for a 
specific data-burst, and the offset-time-based QoS scheme of JET was also 
presented. At the end we discussed node architectures that support 
contention resolution in the optical domain. 

Table 9.1 shows a comparison between the three major switching 
schemes in terms of bandwidth utilization, latency (including setup and 
processing), implementation difficulty, and the adaptability to traffic 
variation. 
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As for bandwidth utilization, since OBS and Optical Packet Switching 
allow out-of-band and in-band statistical multiplexing between data streams 
from different input ports, the wavelength resources are utilized in a much 
more efficient way than the wavelength-routed case, especially when the 
connection calls are short in length. Wavelength-routed networks are largely 
outperformed by the other two switching types because it lacks the ability to 
adapt to variations in traffic pattern. However, the wavelength-routed 
networks are intrinsically easy to implement in comparison with the other 
switching architectures, due mainly to a lower requirement for the switching 
speed of optical fabrics and a relatively small amount of O/E/0 conversion. 

Optical Packet Switching and Optical Burst Switching are subject to a 
very small amount of setup latency. However, the latency for setting up a 
lightpath in conventional wavelength-routed networks is twice as long as 
that of the OBS due to the use of two-pass reservation in the former. The 
setup latency is highly related to the provisioning speed, and behaves as an 
in^ortant performance index in dynamic networks. Therefore, the 
wavelength-routed networks are intrinsically not as dynamic as the other two 
schemes. However, the wavelength-routed networks incur a minimum 
amount of overhead in the overall connection setup/tear-down if the 
connections stand for a longer period of time. The packet switching 
networks have to pay cell/header tax for each optical packet as an expense of 
their high adaptability. 

As discussed in Chapter 7, the most economic shared protection scheme 
with a global optimization in wavelength-routed networks may consume 
30% to 40% of extra network resources for spare capacity. However, in the 
packet/cell switching paradigm, the intrinsic robusmess comes from the 
distributed control and intelligence encoded into the optical packets 
themselves. In both the optical packet and burst switching paradigms, the 
protection paths can be scheduled more efficiently due to the ability of 
statistical multiplexing (i.e.. Packets from different input ports can get access 
to a channel statistically) and the flexibility in distributed buffering of the 
data packetsA)ursts. The task of load-balancing is also easier to achieve with 
finer granularity of traffic. 
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Questions: 

1. Describe the three switching paradigms in optical networks. 

2. Describe the three most distinguishable differences between Optical 
Packet Switching and Optical Burst Switching. 

3. Describe the function of fiber delay lines (FDLs), the purpose of using 
FDLs in optical packet networks, and the function of Optical Burst 
Switching networks. 

4. An end-to-end connection has 100 intermediate nodes. The propagation 
delay from access station to the attached WDM switch is 100 fls , the 
protocol message processing time is 10//S , the cross-connect cut-through 
time is 1 ms, and the propagation delay on a fiber link is 5 /As /km. We 
setup an OC-48 (2.5 Gbps) lightpath to transmit 5 GBytes data flow. 

(a) With the packet-switching technology, each optical packet has a 
length of 100 /AS . What is the total transmission time in dealing with 
this session? 

(b) With the burst-switching technology, each data-burst has a length of 
10 ms. What is the total transmission time? 

(c) With the Lambda-Switching technology, what is the total 
transmission time? 

5. With the JET OBS, an optical data-burst is sent through a pre-defined 
lightpath along a physical path with 5 hops. Two classes of service are 
provisioned; class 0 and class 1. With class 1, an extra amount of offset 
time (which is longer than the length of a data-burst) is imposed. Assume 
the arrival rate of class 0 and class 1 in each node and follows a 
Poisson process. 

(a) What is the blocking probability if the data-burst belongs to the class 
0 and 1, respectively? 

(b) The alternate path between the source and destination is shown in Fig. 
Q.5. What is the blocking probability? 




Fig. Q.5. 

6. What is the purpose of setting up a reservation scheme for an OBS 
network? What is delayed reservation? 

7. We are given a JET OBS network with two types of connection requests 
- class 1: req{\), and class 0: req(0). req(l) and req(0) has an offset time 




276 



^offset ^offiiet > respectively, between the arrival of control packet and 
the corresponding data-burst. req(l) has higher priority than req(0) by 
having . Assume > tgg^^, , where is the 

minimum offset for a data-burst to be transferred successfully. 

(a) What is the purpose of setting up the region of contention for a 
connection request? 

(b) In Fig. Q.7(a), a req{\) arrives at time to with an offset time and 

burst length tl . Is it possible for the req(l) to have contention with 
any class 0 request? 

(c) In Fig. Q.7(b), a req(0) arrives at time to with an offset time t°g^^, and 
burst length tl . 

(d) If the network has , repeat (a) and (b). 
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Fig. Q.7. 
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8. What is the main difference between Just-in-Time and Just-Enough-Time 
signaling protocols? 

9. Do you agree that with the JET signaling protocol, the total network 
throughput will be greater than that of networks using JIT reservation 
protocol? What is the reason? 

10. What are the implementation difficulties of using Optical Packet 
Switching? 

11. What are the benefits of using OBS as compared with the other two 
switching schemes? What are the expenses of using OBS architecture? 

12. Describe the Horizon algorithm for data channel scheduling. 

13. Describe two-pass reservation and one-pass reservation. 




211 



14. If a blocking occurs, a data-burst may be discarded midway in the 
transmission. Will it save in bandwidth if the data-burst was not sent until 
being acknowledged that the connection is established? Detail your 
analysis. 

15. Explain why data-bursts with an extra amount of offset time are subject 
to a smaller blocking probability than those without extra offset time. 

16. Explain why Lambda-Switching (or wavelength-routed) is more efficient 
when the calls are long in length. 

17. Describe the function of SCU. 
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LOOP-LESS K-SHORTEST PATHS ALGORITHM IN 
DIRECTED GRAPHS 



A.l Introduction 

The shortest path between two nodes is the least-cost route connecting 
the node pair given the metric (or cost) of each link. The cost of a route can 
simply be hop count, physical length, or any link-state that considers 
dynamic traffic along the links traversed by the route. The shortest path will 
generally be the preferred path between these two nodes as a coimection 
request arrives. However, in some situations the cost of a path cannot be 
derived by summing up the cost of each link along the path. Therefore, the 
path selection process has to enumerate a number of paths (or alternate 
paths) before a decision is made. Dealing with the non-linear effects of some 
physical impairments upon a lightpath, and employing routing strategy that 
considers traffic grooming and tunnel allocation are a couple of examples. 
To prepare alternate paths for each node pair, we need a tool that generates 
not only the shortest path, but also some non-shortest (or sub-optimal) paths. 
This task is called the ranking of K-shortest paths between a node pair. 

K-shortest paths ranking algorithms have been a research focus since the 
1950s, and are widely used in communication, operation research, computer 
science and transportation science. In this Appendix, an algorithm based on 
Dijkstra’s shortest path first algorithm will be introduced, called Yen’s 
algorithm [1]. 

Two paths may have the same cost with any pre-defmed link metric (e.g., 
hop count). In the following discussion, two or more paths are not allowed to 
have the same cost. This can be achieved by the Tiny Additional Cost (TAC) 
technique introduced in Chapter 8. 
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A.2 Yen’s Algorithm 

Given the shortest path between two nodes (which can be derived by 
Dijkstra’s shortest path first algorithm), the Yen’s algorithm takes every 
node along the shortest path (except the end node) and calculates another 
shortest path (spur) from each selected node to the end node. As shown in 
Fig. A. 1(a), the shortest path from A to F is through A-B-C-E-F., Dijkstra’s 
algorithm is performed between each of the nodes A, B, C, and E (termed a 
branching node for this shortest path) to the end node F (i.e., the node pair A 
to F, B to F, C to F and E to F). However, the link next to the branching node 
on the shortest path is taken away from the network topology before 
Dijkstra’s is performed. In other words, when performing Dijkstra’s 
algorithm across A and F, link A-B is taken away, as shown in Fig. A. 1(b). 
When performing Dijkstra’s algorithm across B to F, link B-C is taken away 
from the network topology, as shown in Fig. A. 1(c). Every time Dijkstra’s 
algorithm is invoked it yields a path segment (termed a spur). The second 
shortest path is generated by concatenating the spurs with the original path 
segment not used along the shortest path. For example, suppose the node 
pair B-F generates a spur. The spur concatenating with link A-B will 
generate the second shortest path, which is termed ranked. In this case, link 
A-B is also called a root. 
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link 



(c) 



Fig. A.I. An illustration of Yen’s algorithm. 
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Yen’s algorithm iteratively calculates the newly generated paths for each 
path that has already been ranked, and also ranks the newly generated paths 
until enough k-shortest paths are ranked. 

Note that the spur of a ranked path has the following restrictions: 

1. It must not pass through its root. For example, in Fig. A. 1(c), the spur 
should not pass through link A-B to keep the generated paths loop-less. 

2. The spur should not pass through the forbidden link as shown in Fig. 
A. 1(b) and Fig. A. 1(c), which is the very next link downstream from the 
branching node. 

The proof of the Yen’s algorithm can be seen in [1]. In Yen’s algorithm, 
the spurs are calculated from each node 0(n) on the previous k-shortest path 
in turn, using the shortest path algorithm 0(n • log n) with a special heap 
data structure. Therefore, the total computation complexity is 0{n^ 'logn) 
for each path. There are several variations to Yen’s algorithm. 

References 

[1] J. Y. Yen, “Finding the k shortest loopless paths in a network”, Management Science, vol. 
17, pp. 712-716, 1971. 
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MAXIMUM-FLOW ALGORITHM 

The algorithm for solving the maximum flow problem (or a combination 
of paths connection from source to sink with the maximum bandwidth) was 
developed by Ford and Fulkerson. 

Given a connected graph G = (V, E), a capacity function of c, a flow f, 
and two nodes s and t, we define the residual capacity of an unidirectional 
arc (v, w), as (v, w) = c(v, w) - /(v, w) . The residual capacity of an arc 

is defined as the extra amount of flow that can be effectively pushed along 
this arc. Note that the flow is directional and can be negative. A flow of the 
amount / from w to v is equivalently in the residual graph that an extra 
capacity /from v to w exists to cancel the original flow. We say that an arc is 
saturated if its residual capacity is zero. Given a path P from s to t, the 
residual capacity of the path is the minimum of the residual capacities of its 
arcs, r. (P) = min r- (v, w) . We also define the residual graph of G with 

•' {v,w)eP •' 

respect to flow / as the directed graph =(V,Ry), where Rf is the set of 

unsaturated arcs, each with its residual capacity. 

Fig. B.l shows an example of the definition of the residual graph. The 
transformation between a graph to its residual graph can be described as 
follows: The arc A-B with capacity/fiow = 7/3 from A to B as shown in Fig. 
B.l(a), will yield two directional links with a capacity 4 and 3, respectively, 
in different directions. The link from B to A with capacity 3 results from the 
fact that the flow from A to B is 3, so there is a capacity from B to A of 3 
that can be canceled out. When a path from 5 to Z) is augmented to the 
residual graph, the residual capacity of each arc is updated, which yields a 
new residual graph. 
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Fig. B.l. An example showing the derivation of a residual graph from its 
original graph, (a) The network showing capacity/flow; (b) The residual 
graph with residual capacity [/]. 

The algorithm initializes the flow / (v, w) 4 — 0 for all v and w, and 
iteratively performs the following steps: 

1. Find any augmenting path, such as a path P from 5 to Z) in G subject to 
the limitation that r (P) > 0. If such a path is found, use it to augment the 
flow; otherwise, stop the algorithm. 

2. For each arc (v, w)e P , let /(v, w) 4 - /(v, w) + (P) and let 

f(w, v) 4 - / (w, v) - ry (P) . 

3. Rebuild the residual graph 

It has been proven that the Ford-Fulkerson algorithm will converge and 
yield an optimal solution with rational arc capacity. This statement is 
intuitive since the total flow that can be launched into a network must be 
well bounded by the link capacity. After the derivation of maximum flow 
between a node-pair, we can derive the minimum cut using that information. 



ACRONYMS 



ACA: Asynchronous Criticality Avoidance 
ADR: Adaptive Dynamic Routing 
AP: Alternate Path 

APS: Automatic Protection Switching 

ASE: Amplifier Spontaneous Emission 

ATM: Asynchronous Transfer Mode 

BGP: Border Gateway Protocol 

BLSR: Bidirectional Line-Switched Ring 

CB-STA: Capacity-Balanced Static Tunnel Allocation 

CR»LDP: Constraint-based Label Distribution Protocol 

CSPF: Constraint-based Shortest Path First 

CTI: Control-to-Transport Interface 

DLCI: Data Link Connection Identifier 

DTA: Dynamic Tunnel Allocation 

DWDM: Dense Wavelength Division Multiplexing 

FDDI: Fiber Distributed Data Interface 

FDL: Fiber Delay Line 

FEC: Forward Equivalence Class 

FF: First-Fit wavelength assignment scheme 

FPLC: Fixed-Path Least-Congested 

FPLC: Fixed-Path Least-Congested Wavelength Assignment scheme 
FR: Fixed Routing wavelength assignment scheme 
FSC: Fiber-Switching Capable 
GMPLS: Generalized Multi-Protocol Label Switching 
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H-FAR: Heavy Fixed Alternate Routing 
IETF: Internet Engineering Task Force 
IGP: Interior Gateway Protocol 
ILP: Integer Linear Programming 
InP: Integer Programming 
IP: Internet Protocol 

IS-IS: Intermediate System-Intermediate System 

ISP: Internet Service Provider 

ITSA: Iterative Two-Step-Approach 

JET: Just-Enough-Time 

JIT: Just-In-Time 

L2SC: Layer-2 Switching Capable 

LAN: Local Area Network 

LAUC: Latest Available Unscheduled Channel 

LLR: Least-Loaded routing 

LP: Linear Programming 

LSC: Lambda-Switching Capable 

MAN: Metropolitan Area Network 

MCMF: Multi-Commodity Minimum Flow 

MG-OXC: Multi-Granularity Optical Cross Connect 

MIP: Mixed Integer Programming 

MIRA: Minimum Interference Routing Algorithm 

MPLS: Multi-Protocol Label Switching 

MS: Maximum Sum wavelength assignment scheme 

MU: Most-Used wavelength assignment scheme 

NEL: Normalized Extra Length (in hop count) 

NIS: Notification Indication Signal 

NMS: Network Management System 

NNI: Network-Network Interface 

OVE\0: Optical to Electronic to Optical conversion 
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0-APS: Optical Automatic Protection Switching Protocol 

OBS: Optical Burst Switching 

OIF: Optical Internetworking Forum 

OSPF: Open Shortest Path First 

p-cycle: Pre-configured Cycle 

PMD: Polarization Mode Dispersion 

PML: Path Merged Label Switched Router 

PSC: Packet-Switching Capable 

PSL: Path Switched Label Switched Router 

PVC: Permanent Virtual Circuit 

QIP: Quadrature Integer Programming 

RF: Random Fit wavelength assigmnent scheme 

RFC: Request for Conunent 

ROW: Right of Way 

RSVP: Resource Reservation Protocol 

RWA: Routing and Wavelength Assigmnent 

SCU: Switch Control Unit 

SHR: Self-Healing Ring 

SIF: SONET Interoperable Forum 

SLSP: Short Leap Shared Protection 

SNR: Signal to Noise Ratio 

SONET: Synchronous Optical Network 

SPAWG: Shortest Path Algorithm with Wavelength Graph 

TDM: Time Division Multiplexing 

TE: Traffic Engineering 

TLV: Type/LevePValue Triplet 

UNI: User-Network Interface 

UPSR: Unidirectional Path-Switched Ring 

UT: Utilization 

VCI: Virtual Circuit Identifier 
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VPI: Virtual Path Identifier 

WAN: Wide Area Network 

WDM: Wavelength Division Multiplexing 

WG: Wavelength Graph 

W-NLS: Weighted Network Link-State 

WSC: Waveband-Switching Capable 

WSP: Widest-Shortest Path First 




Symbols 



K Number of wavelength channels in a fiber 

F Number of fibers along a directional link 

A Arrival rate of connection requests 

// Departure rate of connection requests 

p Potential traffic load {Alp) 

NCk Number of connection requests between the kth S-D pair 

NEL Normalized Extra Length 

KPij K-shortest path set between node pair {i,j) 

NLij Number of lightpaths between node pair (i,j) 

Wa,b Weighted Network Link-state (which is also the potential 

traffic load) of link a-b 

W Average of Wa,b 

SD Set of S-D pairs in the network 

No_SD Number of S-D pairs in the networks 

My Set of alternate patii for S-D pair (i,j) 

Number of alternate paths for S-D pair (i,j) 

B Number of wavebands along a fiber 

B_oc Bandwidth of a connection (in unit of Optical Container) 

a Weighting parameter 




Threshold of criticality 
Number of network events 
Binary indicators 

Number of working paths in a subset 

Set of working paths existing in the network 

Number of working paths existing in the network 

Number of extra nodes added into the wavelength graph 

Set of edges in a network (non-directional) 

Number of edges in the network 

Set of links in a network (unidirectional) 

Number of directional links in the network 

Set of nodes in the network 

Number of nodes in the network 

Set of wavelength planes in the network 

Residual bandwidth (or number of free wavelength channels) 
along link a-b on wavelength plane k 




Glossary 



3R: Re-shaping, Re-timing and Re-generation of optical flows. 

Alternate path; one of the pre-defmed paths across a node-pair, along 
which the connection request of the node-pair may reserve a lightpath. 

Asynchronous transfer mode (ATM): A form of packet transmission and 
switching using cells (which are fixed-size packets). ATM is the data 
transfer interface for Broadband ISDN. 

Augmented model: Different routing and signaling instances are adopted by 
the IP and transport layers. However, unlike the overlay model, the two 
layers communicate by means of some information exchanges (or leakages) 
instead of interfaces or service entities. 

Blocking probability; a performance index by measuring the probability for 
a connection request to be blocked. 

Burstification: the approach of grouping IP packets into data-burst in OBS. 

Capacity-efficiency: qualitatively describe how efficient the algorithms 
utilize the network resources. 

Circuit switching: A method of communicating where a dedicated 
connection path is set up between two devices through one or more 
intermediate nodes. Data flows are sent as a continuous stream with a 
guaranteed bandwidth and propagation delay. 

Communication architecture: the hardware and software structure that 
implements the communication functions. 

Crosstalk: The phenomenon where a signal transmitted on one circuit or 
channel of a transmission system creates an undesired effect in another 
circuit or channel. 

Delayed reservation: with OBS, switching fabrics of the ith intermediate 
node along the selected path is reserved since the arrival of the data-burst 
instead of since the time the control packet is processed. 
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Dijkstra’s algorithm: the most commonly used adaptive routing scheme, 
which greedily finds the least-cost path between source and destination by 
relaxing each node iteratively with the least-cost from the source node. 

DXC: Digital Cross Connect, which is an electronic switch. 

Dynamic RWA process: consider only the traffic distribution in the 
network to perform routing and wavelength assignment; the coimection 
requests arrive at the network one after the other without any knowledge of 
future arrivals. 

Estimate reservation/release: the control packet of OBS sets up a time 
point for a node to reserve/release specific network resources (e.g., switching 
fabrics and wavelength converter). 

Explicit reservation/release: the resources in a node are reserved /released 
right after a corresponding control packet arrives. 

Fiber-Switching: optical flows in a fiber are switched as a whole in an 
OXC. 

Full-wavelength conversion: the ability for a node to exchange the optical 
flows between any two different wavelength planes. 

Full-wavelength convertible: no wavelength continuity constraint needs to 
be considered in the network. 

FXC: Fiber Cross Connect, which is a switch that switches fibers. 

G-LSP: Generalized Label Switched Path, which is a LSP in the domain 
with the GMPLS signaling. 

G-LSR: Generalized Label Switched Router, which is a router with MPLS 
control plane and GMPLS signaling. 

Header: system-defined control information that precedes user-data. 

Hypothetical path: a path between a node-pair along which the traffic of the 
node-pair is most likely issued. The path is defined to facilitate the 
calculation of Weighted Network Link-state (W-NLS). 

Integrated model: A single control plane is used to handle both IP and 
transport layers, where a single routing and signaling instance is adopted by 
the two layers. 




293 



Integrated service digital network (ISDN): a worldwide 

telecommunication service that uses digital transmission and switching 
technology to support voice and digital data communication. 

Inter-arrival planning: the planning efforts made during two consecutive 
network events. 

Lambda-Switching: the optical flows are switched on a per-unit- 
wavelength basis. 

Layer: a group of services, functions, and protocols that is complete from a 
conceptual point of view that is one out of a set of hierarchically arranged 
groups. 

Layer-2 Switching: Packets are switched according to layer-2 frame/cells. 

Link-based protection: every link and node along a working lightpath is 
protected by a corresponding protection path segment. 

Link-state: a general term for the network states, which can be the physical 
topology (e.g., connectivity, link capacity, etc), wavelength availability, and 
the attribute of an input/output port (e.g., granularity, bit-rate or some 
physical constraints). 

Local area network: A commimication network providing interconnection 
of a variety of data communicating devices within a small area. 

LoL: loss of light caused by link failure or path failure, which can be 
detected at any intermediate node of a lightpath. 

LoS: loss of signal at the end node caused by link degradation or path 
degradation. 

LSP: Label Switched Path, which is the route composed of LSR and links 
through which the route traverses. 

LSR: Label Switched Router, which is a router working in the MPLS 
domain, as defined in Section 3.3.2 in detail. 

M/M/K/K queue: a queue with input traffic following Poisson arrival and 
the departure defined in an exponential distribution function, where K 
servers and K accommodations are allocated. 

MG-OXC: an OXC that can provision multi-granularity traffic, including 
Lambda-Switching, Waveband-Switching, and Fiber Switching traffic in the 
optical domain. 
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Multiplexing: a function of transmission that permits two or more data 
sources to share a common transmission medium such that each data source 
has a virtual chaimel. 

Network event: an event that changes the network link-state; in this book, a 
network event is either a lightpath setup or tear-down request. 

Network resources: a general term for the capacity of network components 
that may be exhausted with reservation processes 

Noise: Unexpected signals that distort the signal intended for transmission 
and reception. 

No-wavelength convertible: the wavelength continuity constraint is always 
held in every node in the network. 

Offset-time-based QoS scheme: manipulating the amount of extra offset 
time to define services with different priority. This scheme can be used in 
JET OBS. 

OLXC: Optical Layer Cross Connect, which contains switching fabrics, 
optical components (e.g., demultiplexer/multiplexer, wavelength converters 
and optical amplifiers), and line cards, etc. 

Optical fiber: a thin filament of glass or other transparent material through 
which a signaling-encoded light bean is transmitted by means of total 
internal reflection. 

Overlay model: Use of different control planes on the IP and transport 
layers, where interfaces (User-Network Interface (UNI)) and service entities 
(e.g., some primitives defined across an UNI) are defined for the 
communication between the two layers. 

OXC: Optical Cross Coimect, which is the combination of an OLXC and its 
upper control unit, and is defined in Section 3.3.2. 

PACK: a wavelength assignment principle that tries to pack lightpaths on 
the same wavelength plane to reduce resource fragmentation. 

Partial-wavelength convertible: some nodes in the network cannot perform 
full-wavelength conversion. 

Path-based protection: a lightpath is protected by an end-to-end protection 
path. 
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Path-toiiented spare capacity allocation: Taking each working path 
existing in the network as a basis for performing spare capacity allocation. 

Peer model: routers (in the IP domain) and switches (in the transport plane) 
are taken as peer entities, where a single routing instance is disseminated in 
both domains. 

Protection: the suite of mechanisms performed before a failure occurs, such 
as the monitoring of optical flows and the pre-planning of spare capacity for 
a working path. 

Provisioning priority: the priority of a connection request for a lightpath to 
be set up. Provision priority determines whether or not a connection request 
will be settled if a contention occurs. 

Provisioning speed: qualitatively how fast a lightpath is provisioned after 
the corresponding lightpath setup request is issued. 

PXC: Photonic Cross Connect, which is different from an OXC in that a 
PXC is based on all-optical technology. 

Reachability: defined in the routing table of a border node, which specifies 
the route(s) to the other sub-areas of the Internet. 

Restoration time: the time period between the instant of the occurrence of a 
failure that interrupts a connection and the instant that the connection is 
recovered. 

Restoration: the suite of mechanisms taken to recover the affected traffic, 
such as the signaling efforts coordinated by , configination of switching 
fabrics along the intermediate nodes of the protection path, and traffic 
switchover at the source node. 

SLAs: Service Level Agreements, which are service contracts signed 
between ISP and its customers. 

Span-oriented spare capacity allocation: Taking the working capacity 
along each span as a basis for performing spare capacity allocation. 

SPREAD: a wavelength assignment principle that tries to homogeneously 
spread lightpaths on the same wavelength planes for load-balancing. 

SRLG: Shared Risk Link Group, which is a logical bundle of links that 
suffers the relatively same risk of any impairment or damage to the network. 
For example, the fibers bundled in a conduit may have a common SRLG 
since they share the same risk of being cut if the conduit is cut. 
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Static RWA process: consider full information of traffic matrix (or all 
possible connection requests) and network topology for performing routing 
and wavelength assignment. 

Statistical multi-trial approach: an approach to improve the performance a 
heuristic algorithm, where the input arguments are randomly shuffled in their 
sequences. 

TE TLV: Type/LengthA^alue triplet, which is an object contained in a LSA 
to indicate dynamic network conditions. The TE LSA has only one top-level 
TLV triplet and has one or more nested TLVs (or sub-TLVs) for 
extensibility. The top-level TLV can take one of two values: Router Address 
or Link. The sub-TLVs for the Link TLV are enhanced in support of 
GMPLS. 

Throughput: a performance index derived by accumulating all the 
bandwidth in transmission in the network. 

Topology: the structure, consisting of paths and switches, that provides the 
communications interconnection among nodes of networks. 

TU: Transportation Unit, which represents the transport layer of a node with 
MPLS-based control plane, and is defined in Section 3.3.2. 

Waveband: a number of wavelength channels that can be switched together 
by an OXC; depending on technology, the wavelength channel in a 
waveband can be consecutive or non-consecutive in frequency. 

Waveband-Switching: the optical flows are switched on a per-waveband 
basis. 

Wavelength continuity constraint: a constraint stipulating that the optical 
flows on different wavelength planes cannot be exchanged in an all-optical 
manner, without a wavelength converter. 

Wavelength graph: a technique that models a multi-layered WDM network 
into a single-layered graph so that any routing algorithm can be performed. 

Wavelength plane: all the wavelength channels with the same wavelength 
range which are interchangeable. 
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